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FOREWORD 


These  Transactions  preserve  in  print  most  of  the  invited  addresses 
and  contributed  papers  of  the  Twenty-first  Conference  of  Army  Mathemati- 
cians. These  meetings  are  sponsored  by  the  Army  Mathematics  Steering 
Committee  (AMSC)  on  behalf  of  the  Office  of  the  Chief  of  Research,  Develop- 
ment and  Acquisition.  Members  of  this  Committee  request  that  the  guest 
lecterers  be  effective  researchers  who  are  in  frontier  fields  of  live 
current  Interest.  They  feel  that  the  addresses  by  the  invited  speakers 
as  well  as  contributed  papers  by  Army  personnel  will  stimulate  the  inter- 
changes of  ideas  among  the  scientists  attending  said  meetings. 

In  June  of  1973,  Messrs.  W.  A.  McCool  and  William  Sheperd  of  the 
White  Sands  Missile  Range  called  the  Army  Research  Office  (ARO)  to  inquire 
about  the  possibility  of  holding  either  the  Twentieth  or  the  Twenty-first 
Conference  of  Army  Mathematicians  at  their  installation.  Since  the  next, 
the  Twentieth  Conference,  had  been  scheduled  for  the  US  Army  Natick  Labor- 
aotires,  these  gentlemen  were  thanked  and  advised  that  a written  offer  to 
host  the  1975  meeting  would  be  appreciated.  Dr.  Richard  H.  Duncan,  Tech- 
nical Director  and  Chief  Scientist  of  the  White  Sands  Missile  Range,  in  a 
letter  dated  10  May  1974  issued  a formal  invitation  to  hold  this  meeting 
at  his  installation.  Part  of  his  letter  is  quoted  below: 

I cordially  invite  the  Applied  Mathematics  Subcommittee,  US  Army 
Mathematics  Steering  Committee,  to  hold  the  Twenty-First  Confer- 
ence of  Army  Mathematicians  at  the  US  Army  White  Sands  Missile 
Range,  NM,  in  May  1975.  We  would  be  pleased  to  serve  as  host  to 
the  conference. 


i i i 


I believe  a conference  held  here  would  attract  a number  of 
Western  Army  Mathematicians  who  do  not  normally  attend  the 
conferences.  Their  attendance  and  participation  would  be 
of  substantial  benefit  to  the  Army  installations  in  our 
area. 

If  the  conference  is  held  here,  Mr.  P.  J.  Higgins,  Chief  of  the 
Analysis  and  Computation  Division,  National  Range  Operations 
Directorate,  will  be  in  charge  of  local  arrangements.  Please 
contact  him  for  additional  information. 

Over  115  Army  and  academic  scientists  attended  this  14-16  May  1975 
conference,  which  convened  in  El  Paso,  Texas.  It  Is  pleasing  to  note 
that  over  hald  of  these  individuals  were  from  the  host  Installation.  On 
the  afternoon  of  the  second  day  of  the  meeting,  Mr.  P.  J.  Higgins,  the 
Chairman  on  Local  Arrangments,  arranged  a trip  to  the  White  Sands  Missile 
Range  and  a very  interesting  tour  of  that  base.  Those  in  attendance  are 
indebted  to  him  and  members  of  his  staff  for  this  three  day  meeting. 


The  success  of  this  conference  was  due  to  many  persons,  including 
the  active  and  enthusiastic  members  of  the  audience,  the  chairmen  of  the 
sessions,  and  authors  of  the  many  papers.  The  members  of  the  AMSC  would 
like  to  thank  these  gentlement  for  taking  time  to  prepare  papers  for 
these  Transactions  so  that  many  persons  unable  to  attend  this  symposium 
can  profit  by  their  contributions  to  the  scientific  literature. 
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ON  SPECTRALLY  BOUNDED  SIGNED  GRAPHS* 


A.  J.  Hoffman 

IBM  Thomas  J,  Watson  Research  Center 
Yorktown  Heights,  New  York  10598 


ABSTRACT;  This  is  a summary  of  part  of  the  lecture. 


We  consider  two  questions  concerning  symmetric  (0,1,-1)  matrices 
A with  0 diagonal  and  least  eigenvalues  AX(A):  (1)  can  we  determine 

(roughly)  | A.  (A) | by  the  largest  ojder  of  principal  submatrices  of  certain 
types?  (2)  can  we  approximate  A- A (A) I by  the  gramraian  of  a (0,1,-1) 
matrix? 


We  shall  consider  the  class  of  all  signed  graphs  G(i.e.,  graphs 
in  which  any  pair  of  adjacent  vertices  are  joined  by  a positive  edge  or 
negative  edge),  and  their  associated  adjacency  matrices  A = A(G)  defined 
by 


/ 0 if  i and  j are  non-adjacent  vertices  (thus 
J a..  = 0 for  all  i) 

3-  r H 

ij  = \ 1 if  i and  j are  joined  by  a positive  edge 

l-l  if  i and  j are  joined  by  a negative  edge 

If  A^is  anjj  symmetric  matrix,  we  denote  its  eigenvalues  in  ascending  order 
by  A (A)<A  (A)  < . . . . If  G and  H are  signed  graphs,  we  say  G<=H  (G  is  an 
induced  subgraph  of  H)  if  A(G)  is  a principal  submatrix  of  H. 

The  first  question  we  consider,  in^analogy  to  [3]  for  ordinary 
graphs,  is  whether  the  magnitude  of  A (G)=A  (A(G))  can  be  in  anyway  re- 
lated  to  the  size  of  certain  induced  subgraphs  of  G,  To  explain  the  ques- 
tion (and  the  answer),  we  must  first  introduce  some  notation: 

A large  empty  circle  O will  denote  an  independent  set  of  t vertices, 
A large  circle  with  a + inside  (T)  will  denote  a set  of  t vertices,  every 
pair  of  which  are  joined  by  a positive  edge;  a large  circle  with  a - inside 
- will  denote  a set  of  t vertices,  every  pair  of  which  is  joined  by  a 
negative  edge.  The  symbol 


means  that  a single  vertex  is  joined  by  a positive  edge  to  each  of  t vertices, 
whose  relation  to  each  other  is  specified  by  ? (All  4-  all  - or  independent). 
The  meaning  of 


T: 


© 


should  be  clear. 

* This  work  was  supported  (in  part)  by  the  Army  Research  Office  under 
contract  number  DAHC04-74-C-0007 . 
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Finally, 

t:  ® + <2> 

means  that  each  of  the  left  hand  set  of  t vertices  is  joined  by  a positive 
edge  to  each  of  the  right  hand  set  of  t vertices;  and  the  symbol 

t:  © __  0 is  similarly  defined. 


Consider  now  the  following  graphs: 


(1) 

t : 

© 

+ 

(2) 

t: 

_o 

(3) 

t : 

-O 

■f 

+ 

(4) 

t : 

© 

© 

+ 

(5) 

t : 

0 

_ 

+ 

(6) 

t : 

© 

0 

(7) 

t : 

© 

- 

© 
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We  define,  for  each  graph  G,  t(G)  to  be  the  smallest  integer  t 
such  that  none  of  (1)-(1Q)  is  an  induced  subgraph  of  G, 

One  copy  can  readily  prove  that  t(G)  cannot  be  large  if  |x^(G)| 
is  of  modest  size*  The  reason  is  that,  if  H is  any  one  of  (1)-(10), 

i i f 

A as  But  H^cG  implies  A (H  )>A  (G) . The  point  of  interest 

is  the  converse.  If  t(G)  is  modest,  so  is  |A^(G)|  in  the  following^sense : 
Theorem  1 . Each  of  the  two  functions  of  signed  graphs,  t(G)  and  |AX(G)| 
is  bounded  by  a function  of  the  other. 

We  will  not  be  specific  about  these  functions  (here  and  in  the 
theorems  stated  below) , since  the  functions  produced  by  the  proofs  (which 
depend  heavily  on  the  use  of  Ramsey’s  theorem  and  its  relatives  [5])  are 
enormous,  and  manifestly  very  far  from  being  accurate.  The  points  to  be 
emphasized  are 

(i)  the  functions  are  functions  of  one  variable  only,  and  in  particular 
do  not  depend  on  the  number  of  vertices;  and 

(ii)  it  would  be  of  interest  to  find  better  estimates  of  these  functions, 
closer  to  the  truth. 

The  second  question  to  be  considered,  inspired  by  [1],  is  the 

following*  Let  G be  a signed  graph,  then  how  can  we  represent  the  positive 

1 T 1 

semi-definite  matrix  A(G)-A  (G)I  by  a grammian  KK  ? In  case  A (G)=-2, 

the  question  has  a nice  geometric  interpretation.  The  rows  of  K are 

vectors  of  length  and  the  inner  products  of  the  rows  of  K are 

Hence,  the  question  is  equivalent  to  finding  ways  of  placing  vectors  in 

Euclidean  space  so  that  any  two  make  an  angle  of  0 ,60  , or  120  , From 

the  theory  of  root  systems  in  Lie  groups  [2],  one  can  infer  that,  if  G 

is  connected,  then  there  exists  a (0,1,-1)  K satisfying 

(11)  A(G)-X1(G)I  = KKT 

with  finitely  many  exceptions. 

We  jre  led  to  conjecture  that  (11)  might  hold  for  other  integral 
values  of  A (G) . 

Conjecture  1.  If  A^(G)  is  integral  and  G is  connected,  then, 
with  finitely  many  exceptions  there  exists  a (1,-1 ,0)  matrix  K such  that 
(11)  holds. 

It  is  easy  [6]  to  show  that  Conjecture  1 is  true  (with  no  ex- 
ceptions) if  A (G)  - -1.  But  conjecture  1 is  false  for  all  other  integral 
A (G)  - “3,-4,...  . The  counter-examples  are  constructed  inductively  as 
follows.  Find  a G such  that  A (G)  - -r,  and  (11)  does  not  hold  for  that 
G(this  can  be  done  for  r = 2).  Make  up  a graph  H consisting  of  an  enormous 
number  of  copies  of  G (enormous  depends  on  r) , with  all  instances  of  vertex 
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i of  G (|or  all  i)  joined  by  a positive  edge.  Then  (11)  will  be  false  for 
H,  and  A (H)  - -(r+1).  There  will  be  an  infinite  number  of  such  H,  since 
we  can  take  an  arbitrarily  large  number  of  copies  of  G to  make  H,  Now  the 
induction  continues,  using  only  one  of  the  H in  place  of  G. 

There  is  another  reasonable  conjecture  (whether  or  not  A^(G)  is 
an  integer) : 

Conjecture  2 » There  exists  a constant  C such  that,  for  each 
signed  graph  G,  there  exists  a (1,-1, 0)  matrix  K with 

(12)  | | (A(G)  - X1(G)X)  ~KKT||<C. 


Here*  | | | | stands  for  the  usual  operator  norm* 

Conjecture  2 is  also  f^lse,  by  the  following  example.  Assume  C 
given*  Choose  an  integer  m>C+^.  Choose  an  integer  n so  that 

/ 4m^  + hrv  - 2n<l,  and 

2 

n -C  1 
n m+n+C  2 
/ \ 

Now  construct  a graph  G^p  as  follows.  Begin  with  a claw  K Denote 

2 2 1 n^ 

the  n vertices  of  the  claw  by  l,...,n  . In  addition,  constrict  2m 

(positive)  cliques,  each  of  size  p,  adjacent  to  1;  2m  positive  cliques, 

each  of  size  g,  adjacent  to  2;.,,;  2m  positive  cliques,  each  of  size  p, 

adjacent  to  n . It  can  be  shown  that,  for  p large  enough,  (12)  will  be 

false. 


It  turns^out,  however,  that  a weak  version  of  (12)  is  true,  with 
C depending  on  A . In  fact,  we  have  the  following  results,  improving 
[3],  and  using  methods  of  [3]  and  [4]. 

Theorem  2t  There  exists  a fixed  function  f(x)  such  that,  for 
every  signed  graph  G,  there  exists  a (+1,-1, 0)  matrix  K such  that 

(13)  | | (A(G)  - X1(G)I)  - KKT | j <f (X1(A) ) - 

Further,  the  number  of  nonzero  entries  in  each  row  of  K is 

min( [-A^ (G) , - A^(abs  G) ] ) • 

(Here,  [x]  is  the  greatest  integer  at  most  x,  abs  G is  the  ordinary  graph 
obtained  from  G by  making  all  edges  positive) . 

Theorem  3*  There  exists  a fixed  function  f (x)  such  that,  for 
every  signed  graph  G,  there  exists  a (+1,-1,0)  matrix  K,  every  row  of  K 
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containing  min  ([-X^CG)],  [-A"^(abs  G)])  nonzero  entries,  so  that  (13)  and 
(14)  ||  A(abs  G)  - A1 (abs  G)l  - (abs  K)  (abs  K)T| | <f (A1(A)) 


hold.  (Here,  abs  K is  obtained  from  K by  replacing  each  -1  by  +1). 
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EFFECTS  OF  CONCENTRATED  MASS  AND  THRUST  DIRECTIONAL  CONTROL 
ON  THE  STABILITY  OF  FREE-FREE  BEAMS 


Julian  J.  Wu 
Benet  Weapons  Laboratory 
Watervliet  Arsenal 
Watervliet,  New  York  12189 


ABSTRACT.  The  application  of  the  finite  element  technique  to  non 
self-adjoint,  more-than- two-points  boundary  value  problems  was  demonstrated 
in  a recent  paper  on  missiles'  stability  analysis.  Numerical  results  are 
limited  there.  However,  they  have  brought  new  understanding  on  the  basic 
stability  behaviors  of  a free-free  beam.  The  effect  on  such  behaviors  due 
to  a concentrated  mass,  modeling  a piece  of  heavy  machinery,  was  also 
shown  for  some  special  cases. 

The  present  paper  is  an  extension  of  the  previous  work  to  include  the 
location  as  well  as  the  amount  of  the  concentrated  mass  as  parameters. 

Also  included  is  an  extensive  study  of  the  thrust  directional  control 
feedback  and  its  effect  on  the  stability  behavior's  of  free-free  beams. 

1.  INTRODUCTION.  The  study  of  nonconservative  stability  of 
structures  without  fixed  supports*  has  turned  out  many  surprises.  Although 
extremely  similar  to  those  with  fixed  supports  in  terms  of  the  governing 
equations,  the  so-called  "free"  structures  present  difficulties  not  only 
in  methods  of  solutions  but  in  the  interpretation  of  the  solutions  them- 
selves. Many  basic  features  of  the  problem  were  not  sufficiently  exploited 
in  the  literature  dealing  with  "free"  structures  [1-8] . As  Solarz  pointed 
out  in  conjunction  with  a "free-free"  beam  problem  [6] , that  the  assumption 
of  a fixed  end  could  remove  some  important  features  of  the  structural 
motion  as  a "rigid  body"  with  its  vibrations.  No  adequate  explanations  were 
given,  however.  And  to  the  best  of  our  information,  none  was  found  else- 
where. 

Let  us  ask  a trivial  question,  for  example:  How  many  zero  eigenvalues 

are  there  associated  with  the  vibration  of  a uniform,  free-free  beam? 

The  answer  is  two;  and  this  can  be  easily  shown  by  going  through  the 
process  of  the  separation  of  variables  for  partial  differential  equations. 
However,  in  some  of  the  most  well-known  text  (i.e.  [9,  10]),  only  one  zero 
eigenvalue  was  reported.  Probably  such  an  inaccuracy  is  of  no  consequence 
for  the  cited  example  per  se,  except  for  the  case  of  the  same  beam  but  now 


*As  an  initial  step,  Only  beam  structures  are  considered  since  they  are  the 
simplest  possible  continuous  systems. 
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also  subjected  to  an  initial  axial  force.  Since  then,  the  realization 
of  the  correct  number  of  zero  eigenvalues  of  the  first  problem  is 
essential  to  the  proper  interpretation  of  the  stability  data  of  the  second* 

A numerical  analysis  of  such  problems  and  some  other  points  have  already 
reported  by  the  present  writer  [11,  12]. 

The  purpose  of  the  present  paper  is  to  include  some  additional 
computational  results  on  the  stability  behavior  of  a free-free  beam.  In 
particular,  the  effect  due  to  the  amount  and  location  of  a concentrated 
mass  and  the  amount  and  the  sign  of  the  directional  control  parameter  are 
analyzed. 

2.  PROBLEM  FORMULATION.  Let  us  consider  the  plane  motion  of  a 
uniform  missile  attached  with  a concentrated  mass  travelling  under  a 
constant  acceleration  in  the  axial  direction.  The  only  forces  are  the 
thrust  P and  the  inertia  force  (pA£+M)a,  where  p = density  of  the  material, 

A = area  of  cross-section,  l = length  of  the  missile,  M = amount  of  the 
concentrated  mass  and  a = axial  acceleration. 

Using  the  notion  well  described  by  Bolotin  [13],  we  shall  first  discuss 
the  undisturbed  form  of  equilibrium.  The  stability  of  such  an  equilibrium 
will  then  be  determined  by  an  investigation  of  the  disturbed  motion,  which 
is  the  prime  concern  of  our  analysis. 

The  undisturbed  state  (Figure  1)  is  governed  by  three  equations: 


(la) 

Z F. 

(lb) 

Z F, 

(lc) 

Z M 

Equations  (1)  states  that  the  total  force  in  X-direction,  in  Y-direction  and 
the  total  moment  in  X-Y  plane  are  zero.  From  equation  (la),  we  have 

(2a)  P = (pA£+M)a 

or 

t2W  a“  pM 


Thus  the  initial  force  acting  on  the  cross-section  of  the  beam  is  given 
by 
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(3) 


T(x)  = i 


' (pAx)a  = j (— ^-  ),  95.x<xin 
& 1+m 

(pAx+M)a  = (■ | + m)  (—) , xjn<x<Jl 

where  x is  measured  from  the  tip  of  the  beam,  m = and  x,.  denotes  the 

pAJt  111 

location  of  the  concentrated  mass.  Equation  (lb)  is  always  satisfied 
because  there  is  no  force  acting  in  Y direction.  So  is  equation  (lc)  due 
to  the  fact  that  the  thrust  and  the  inertia  force  are  colinear  in  the 
undisturbed  state. 

The  disturbed  motion  will  be  defined  by  a small  lateral  displacement 
u(x,t),  perpendicular  to  the  undisturbed  (rectilinear)  axis  (Figure  2).  As 
a means  to  manipulate  the  stability  behavior,  we  assume  that  the  direction 
of  P can  be  rotated  through  a small  angle  0 about  the  tangent  at  the  tail 
end  of  the  disturbed  beams  such  that 

(4)  6 a *e  u'  (xQ , t) 

where  Kq  is  a nondimensional  design  constant,  X0  denotes  the  location  of  the 
direction  control  sensor.  The  only  limitation  on  Kq  is  that  6 be  small  so 
that  the  linearized  equations  of  disturbance  are  valid.  These  equations 
consist  of  the  following: 


(5a) 

D.  E.  EIu 

""  + [T(x)u*]'  + pAii  + MiKxjj)  <5(x-ym)  = 0 

(5b, c) 

B . C . u * * 

(0)  = 0,  u'  " (0)  = 0 

(5d) 

uf ' 

(*)  - o 

(5e) 

EIu 

• 1 • (*)  = P 0 

= PK0u' (x0) 

where  E = Young's  modulus,  I * second  moment  of  the  cross-section  and  6(x)  is 
the  Dirac  function.  T(x)  is  given  by  equation  (3)  and  0,  by  (4).  A prime 
(')  denotes  a differentiation  with  respect  to  the  spatial  coordinate  x and 
a dot  (*)»  a differentiation  with  respect  to  time  t.  Equations  (5)  can  be 
conveniently  derived  by  integrating  the  three-dimensional  equations  for  the 
disturbed  motion  given  in  reference  [14]. 

It  will  be  convenient  to  write  equations  (S)  in  terms  of  dimensionless 
quantities.  Thus, 

(6a)  D.E.  u"  ' 1 + (f(x)u')  ' + A^u  + mA^u(xm)  6(x-xm)  - 0 

(6b, c)  B.C.  u"(0)  = 0,  u"'(0)  = 0 
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(6d)  u"(l)  = 0 

(6e)  u'"(l)  = KeQu'(xe)  = 0 

where  f(x)  in  (6a)  is  given  by 


(7) 


f(x) 


Qlx » 0 £ x £ Xm 

QjX+mQj,  Xjh  < x < 1 


Q:  * Q/ (1+m)  and  Q = P£2/(EI). 


The  spatial  variables  are  made  dimensionless  through  a division  by  JL 
The  time  t is  made  dimensionless  through  a division  by  the  constant 
c = (pAS,4/EI)l/2,  which  is  in  real  time  unit  (seconds,  for  example).  In 
equations  (6),  we  have  also  assumed  that  the  solution  of  u(x,t)  is 
exponential  in  time,  i.e., 


(8)  u(x,t)  = ufxje^. 

There  should  be  no  confusion  from  using  the  same  letter  u for  two  different 
functions  u(x,t)  and  u(x). 

3.  BASIS  OF  SOLUTIONS  AND  THE  ADJOINT  PROBLEM.  Numerical  solutions 
to  the  equations  of  the  disturbed  motion  (equations  (6))  will  be  obtained 
through  the  finite  element  technique  - adjoint  variational  formulations. 

The  basis  of  this  formulation  is  given  here.  Through  integrations-by-parts, 
it  is  straightforward  to  show  that  the  adjoint  problem  of  equations  (6)  and 
the  associated  variational  statement  are  given  in  equations  (9)  and  (10) 
respectively. 


(9a) 

D.E. 

v,,M  + [f(x)v']‘  + X2v  + mA2  v(x]n)  6(x-xm) 
- QK0  v(l)  6' (x-x0)  = 0 

(9b, c) 

B.C. 

v' ' (0)  =0,  v"  ' (0)  = 0 

(9d) 

v*'(l)  + Qv (1)  = 0 

(9e) 

v'*'(l)  + Q v*(l)  = 0 

and 

(10a) 

61  = 0 

1 

1 = / 
o 

[u'  *v*  ’ -f  (x)u,v'+A2uv+mA2u(xjn)v(xm)6(x-xm)]dx 

(10b) 

+ Qu' 

(l)6v(l)  + QKeu' (xq)v (1) 
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where  v(x)  is  the  adjoint  field  variable  and  f(x)  is  given  in  Eq.  (7). 

It  is  worthwhile  to  note  that  the  boundary  conditions  of  Eqs.  (6) 
and  (9)  are  all  natural  boundary  conditions.  Thus  the  variational  statement 
(10)  is  completely  unconstrained.  In  other  words,  the  coordinate  functions 
used  for  Rayleigh-Ritz  approximation  need  not  to  satisfy  any  of  the 
boundary  conditions  [15]. 

We  further  note  that  Eq.  (6)  is  not  exactly  a two  point  boundary  value 
problem  unless  Xa  = 1.  It  is  clear  that  the  boundary  condition  of  the  type 
of  Eq.  (6e)  in  which  xq  * 1 does  not  present  any  difficulty  when  the 
variational  statement  (10)  is  used  for  finite  element  solutions. 

For  the  special  case  Xg  = 1,  the  adjoint  problem  of  Eqs.  (9)  can  be 
written  in  a slightly  different  form: 


(lla)  D.E. 

v"  ' * + [f  (x)v']  ,+X2v+mX2v(xm)6(x-xm)  = 0 

(llb,c)  B.C. 

v' ' (0)  = v'  " (0)  = 0 

(lid) 

v"(l)  + Qd+Kg)  v ( 1 ) = 0 

(lie) 

v'  " (1)  + Qv'  (1)  = 0 

Once  the  variational  statement  has  been  established,  it  is  a routine  matter 
to  apply  the  finite  element  discretization.  In  conjunction  with  adjoint 
variational  statements,  this  procedure  has  also  been  well  documented 
[11,  15,  16,  17]  and  will  not  be  repeated  here. 

4.  STABILITY  BEHAVIOR  DUE  TO  DIRECTIONAL  CONTROL.  Prior  to  the 
presentation  of  numerical  results  and  discussions,  it  would  be  worthwhile 
to  recapitulate  some  basic  definitions  and  concepts.  As  mentioned  in 
Section  2,  the  disturbance  is  assumed  to  be  in  the  form  of 

(10)  u(x,t)  = u(x)eXt. 

The  eigenvalue  X is  a complex  number  in  general.  Thus  we  write 

(11)  X = XR  + i Xj 

where  i = v^T.  Both  XR  and  Xj  are  real  numbers.  When  XR  is  negative  or 
zero,  the  disturbance  u(x,t)  will  decrease  with  time  or  remain  finite,  and 
the  structure  is  thus  considered  stable.  When  XR  is  nonzero  and  positive, 
u(x,t)  will  grow  with  time  and  instability  occurs.  Divergence  instability 
is  characterized  by  Xj  » 0 and  flutter  (or  oscillatory)  instability,  by 
Xj  ^ 0.  In  the  particular  case  when  both  XR  and  Xj  are  zero,  the  solution 
represents  a rigid  body  motion.  According  to  the  definition  of  stability 
described  in  reference  [13],  it  implies  that  small  disturbance  leads  to 
small  deviation  from  the  unidsturbed  state.  In  case  of  small  rigid  body 
motion,  disturbance  and  deviation  are  the  same.  Therefore  the  rigid  body 
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translation  and  rotation  associated  with  the  small  disturbance  must  be 
considered  stable  modes. 

In  a continuous  system,  there  are  infinite  number  of  eigenvalues 
corresponding  to  various  modes  of  vibrations.  Since  a large  Xj  represents 
a high  frequency  mode  which  is  less  likely  to  realize  than  the  lower 
frequency  modes,  only  the  lower  end  of  the  frequency  spectrum  is  of  physical 
importance.  In  a finite  element  discretized  system,  depending  on  the  number 
of  elements  used,  a finite  number  of  eigenvalues  are  obtained  to  approximate 
the  actual  eigenvalues  of  lowest  magnitudes.  For  the  data  presented  in 
this  paper,  nine  (9)  elements  were  used.  From  our  previous  experience 
on  similar  problems,  the  accuracy  of  these  data  should  be  within  about  one 
percent  (1.0%  or  less  compared  with  the  exact  solutions  [11]). 

First  let  us  consider  the  case  without  thrust  directional  control 
(Kg  = 0) . The  numerical  values  of  X of  the  four  lowest  magnitudes  are 
given  in  Table  1 for  various  values  of  Q and  the  curves  of  X vs.  Q are 
shown  in  Figure  3*.  As  shown  in  Table  1,  the  first  branch  of  eigenvalues 
is  zero  throughout  the  range  of  Q.  This  solution  corresponds  to  a rigid 
body  translation  and  is  well  known.  The  second  branch  has  eluded  many 
investigators.  At  Q * 0,  there  is  another  zero  eigenvalue  which  corresponds 
to  a (small)  rigid  body  rotation.  For  Q > 0,  however,  X takes  a (positive) 
real  value.  This  mode  of  disturbance  must  be  a bending  mode  because  it  is 
not  a rigid  body  mode.  Its  magnitude  will  no  longer  remain  small  but  grow 
with  time.  This  unstable  bending  mode  of  disturbed  motion  was  first 
realized  by  the  present  writer  [11] . 

Again,  the  third  and  the  fourth  branches  of  eigenvalues  are  well  known 
[3,  4,  8].  Prior  to  Qpn  = 10.93tt^,  the  eigenvalues  are  pure  imaginary, 
indicating  stable,  oscillatory  motion.  At  Q^g,  these  two  branches  coalesce 
and,  beyond  which,  the  values  of  X become  complex,  indicating  flutter 
instability.  Due  to  the  unstable  bending  mode  of  the  second  branch,  however, 
this  critical  thrust  has  lost  its  significance  and  we  thus  refer  to  it 
as  a pseudo-critical  thrust  (Figure  3) . 

For  a negative  Kg  (Kg  » -0.1  for  example),  the  first  bending  mode  is 
associated  with  a positive  real  X,  as  shown  in  Figure  5,  indicating 
divergence  instability.  For  a given  positive  Kq,  however,  a region  of 
stability  exists  between  Q = 0 and  Q * Qcr>  beyond  which  the  structure 
becomes  unstable  due  to  divergence.  As  reported  in  the  previous  investiga- 
tions [3,  8],  this  critical  thrust  Q^R  was  thought  to  be  a constant  for 
all  values  of  Kg.  This  is  not  conceivable  in  the  light  of  the  present 
analysis.  Since  = 0 at  Kg  - 0 due  to  the  unstable  first  bending  mode, 


*When  X is  either  a purely  real  or  a purely  imaginary  number,  it  appears 
as  a pair  of  the  same  absolute  value  but  with  opposite  signs.  Only  the 
positive  ones  are  shown.  In  Figures  3,  5 and  6,  we  have  plotted  Xj  vs.  Q 
in  the  upper  plane  and  (positive)  Xg  vs.  Q in  the  lower  plane. 
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we  expect  to  vary  (continuously)  from  zero  as  K3  increases.  Again, 
this  is  substantiated  by  our  numerical  results.  It.  Figure  6,  the  detailed 
variations  of  the  eigenvalue  of  the  first  bending  mode  vs.  the  thrust 
parameter  Q is  shown  for  various  values  of  Kq  from  Kq  = 0 up  to  0.5.  The 
trend  of  Q^R  from  zero  as  Kq  increases  is  clearly  observed.  The  variation  of 
Qcr  vs.  Kq  is  again  plotted  in  Figure  7.  This  variation  of  Q^/ir2  is  shown 
to  be  extremely  sensitive  in  the  range  from  0.007  to  0.05.  It  approaches 
rapidly  and  asymptotically  to  the  value  of  2.58  as  Kq  becomes  greater  than 
1.0.  This  curve  was  substituted  by  a constant  value  of  2.60  in  the  previous 
investigations. 

Before  we  conclude  this  section,  let  us  consider  the  special  case  of  a 
uniform  missile  under  a constant  thrust  fixed  in  the  direction  of  the 
undisturbed  axis.  This  problem  was  first  considered  by  Silverberg*  [1] 
and  is  clearly  a subcase  of  the  present  analysis  with  xq  = 1.0  and  Kq  * -1.0 
(i.e.,  0 = -v'(l)).  By  a purely  analytical  approach,  Silverberg  was  able 
to  obtain  the  first  non- zero  Q at  which  the  eigenvalue  vanishes  (Q  = 

2.60tt2).  Thus  he  concluded  that  Q = 2.60tt2  is  the  buckling  load.  From  the 
present  analysis  as  shown  in  Figure  4,  however,  even  though  Q = 2.55tt2  is 
a point  of  vanishing  eigenvalue,  it  cannot  be  considered  a critical  load 
due  to  the  fact  that  the  structure  is  unstable  prior  to  this  load.  This  is 
indicated  by  the  real  positive  eigenvalue  curve  for  Kq  * -1.0  for 
Q < 2.55tt2  which  is  a region  of  divergence  instability.  Thus  we  have  here 

an  example  showing  the  stability  nature  of  a structure  cannot  be  determined 

merely  by  seeking  out  the  loading  parameters  of  vanishing  eigenvalues. 

5.  STABILITY  BEHAVIOR  DUE  TO  A CONCENTRATED  MASS**.  The  effects  on 
the  stability  behavior  of  a uniform  beam  due  to  a concentrated  mass  are 
shown  in  Figures  7 through  11. 

For  a tip  mass  of  two  percent  of  the  total  mass  of  the  beam  (xm  =0, 

m = 0.02),  there  is  a region  of  stability  between  Qj  = 2.7tt2  and  Q~  = 10.6tt  . 

For  Q less  than  Qj,  the  beam  is  unstable  due  to  divergence  of  the  first 
bending  mode.  For  Q greater  than  Q2,  flutter  occurs  due  to  the  coalescence 
of  the  second  and  the  third  bending  modes  (Figure  7).  As  the  tip  mass 
increases  (m  = 0.04  in  Figure  8 and  m = 0.06  in  Figure  9),  the  region  of 
stability  varies  in  such  a manner  that  the  values  of  Qj  and  Q2  decrease 
(Qj  = 1.5tt2  and  Q2  = 8.2ir2  for  m = 0.04;  Qi  = 0.9tt2  and  Q2  = 6.8t r2  for 
m = 0.06). 

It  is  also  observed  that,  in  Figures  8 and  9,  that  flutter  occurs  as 
the  first  and  the  second  bending  mode  coalesce  whereas  in  Figure  7,  it 
occurs  as  the  second  and  the  third  branches  coalesce. 


*Matsumoto  and  Mote  credited  the  solution  to  Beal.  But  Beal  himself 
rightfully  attributed  it  to  Silverberg. 

**TJie  data  reported  here  supersedes  some  of  those  in  Ref.  [11]  as  an  error 
was  found  in  the  computer  program  used  for  previous  calculations. 
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Some  indications  of  the  effect  due  to  the  location  of  the  concentrated 
mass  can  be  seen  from  Figures  9,  10  and  11  in  which  the  stability  curves 
are  calculated  for  a concentrated  mass  m = 0.06  placed  at  locations 
x^  = 0,  xm  = 0.5  and  x_  = 1.0  respectively.  At  locations  xm  = 0.5  and  1.0, 
the  curves  resembles  the  ones  with  no  concentrated  mass  (Figure  3),  i.e., 
there  is  no  region  of  stability  as  shown  in  Figure  10  and  11.  However  at 
Xjjj  = 0,  a region  of  stability  can  be  attained  within  the  operating  thrust 
range  of  Qj  = 0.9tt2  and  Q2  = 6.8it2  (Figure  9). 
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FIGURE  1.  THE  UNDISTURBED  EQUILIBRIUM  OF  A FREE-FREE  BEAM 
UNDER  A CONSTANT  THRUST  AtfD  THE  INERTIA  FORCE 


FIGURE  2.  A SMALL  DISTURBANCE  FROM  THE  UNDISTURBED  AXIS 
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FIGURE  3.  BRANCH  CURVES  OF  THE  FOUR  LOWEST  EIGENVALUES  ---  DISTURBANCE 
OF  A FREE-FREE  BEAM  UNDER  A CONSTANT  THRUST  WITHOUT  CONTROL 
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FIGURE  5.  THE  FIRST  AND  SECOND  NON-ZERO  EIGENVALUE  CURVES  FOR  VARIOUS  DIRECTION 
CONTROL  PARAMETERS  K0 
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FIGURE  9.  BRANCH  CURVES  OF  THE  LOWEST  EIGENVALUES  (xm  =0,  m = 0.06) 
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TABLE  1.  NUMERICAL  VALUES  OF  THE  FOUR  LOWEST  EIGENVALUES  FOR  A UNIFORM  FRE 
FREE  BEAM  UNDER  A CONSTANT  THRUST  WITHOUT  DIRECTIONAL  CONTROL 
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THE  STANDARD  LINEAR  MODEL  IN  THE  STABILITY 
AND  MASS  OPTIMIZATION  OF  NONCONSERVATIVE  EULER  BEAMS 


Charles  R.  Thomas 
Benet  Weapons  Laboratory 
Watervl let  Arsenal 
Watervllet,  New  York  12189 


ABSTRACT.  A dimensionless  stability  problem  together  with  its  adjoint 
problem  has  been  formulated  for  nonconservative,  cantilevered  Euler  beams 
with  linear  external  damping  and  internal  damping  according  to  the 
standard  linear  model.  Mass  optimization  is  considered  for  a beam  with 
a linear  distributed  tangential  load  acting  along  the  centerline.  Graphi- 
cal optimization  plots  are  shown  and  utilized  as  initial  guesses  in  a 
Rosenbrock  optimization  routine  which  indicates  mass  reductions  in  the 
range  of  20%  to  in  excess  of  30%  are  possible. 

1.  INTRODUCTION.  The  stability  problem  was  formulated  for  non- 
conservative,  cantilevered  Euler  beams  with  linear  external  damping  and 
internal  damping  according  to  the  standard  linear  model.  A convenient 
dimensionless  form  of  the  original  equations  was  Introduced  and  an  adjoint 
system  of  equations  was  derived.  These  equations  were  then  utilized  in 
developing  an  adjoint  variational  principle  which  yielded  a characteristic 
equation  for  critical  flutter  load  after  a proper  application  of  a 
generalized  Ritz  procedure.  Specific  numerical  results  are  then  given  for 
Hauger's  problem,  that  is  for  a beam  with  a linear  distributed  tangential 
load  acting  along  with  the  centerline  of  the  beam. 

The  stability  problem  shows  several  most  Interesting  results  with 
the  introduction  of  the  additional  internal  damping  parameter  for  the 
standard  linear  model  as  opposed  to  the  single  internal  damping  parameter 
of  the  Kelvln-Voigt  model.  Several  Interesting  cross-plots  of  flutter 
load  versus  both  Internal  and  external  damping  parameters  show  a 
considerable  variation  of  flutter  load  with  damping  parameters  and  the 
ability  to  determine  a maximum  flutter  load  for  the  case  of  any  one 
damping  parameter  fixed  with  the  ability  of  allowing  the  other  two  damping 
parameters  to  vary  until  the  maximum  is  achieved. 

The  basic  optimization  procedure  was  to  fix  or  choose  a desired  flutter 
load  and  then  numerically  determine  the  optimum  design  which  minimizes  the 
beam  mass.  A special  class  of  generalized  parabola  type  boundary  curves 
with  beam  thickness  being  a function  of  axial  displacement  was  applied  to 
Hauger's  problem  for  a beam  of  rectangular  cross-section.  Important 
constraints  considered  are  that  the  Euler  Beam  Theory  Is  only  valid  for  a 
thickness  to  length  ratio  smaller  than  1/10  th  and  that  practically  the 
free  end  of  the  beam  must  be  of  finite  thickness.  Graphical  results  showed 
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that  considerable  weight  reductions  were  possible  and  yielded  excellent 
starting  values  for  the  optimization  procedure.  A Rosenbrock  optimization 
routine  with  a minimum  tip  thickness  constraint  imposed  was  then  applied 
to  several  beams  with  different  values  of  internal  and  external  damping 
parameters  with  mass  reductions  in  the  range  of  20%  to  In  excess  of  30% 
resulting. 


2.  THE  EQUATION  OF  MOTION.  The  equation  of  motion  for  the  vibration 
of  Euler  beams  with  both  internal  and  external  damping  was  derived  in 
reference  [1].  The  purpose  of  this  section  will  be  to  develop  similar 
equations  which  now  Include  stability  terms. 


Following  Brunei! e [2],  displacements  are  assumed  of  the  form 


(D 


v = z i|>(y,t) 
w = w0(y)  + w(y,t), 


the  strain-displacement  relations  are 


(2) 


-yz 


- 1 r^w  . 

- 2 [37  + 


and  the  stress-strain  law  becomes 

/o\  r r , 

<3)  ' E z * ‘ ‘ E 2 ^ 

if  one  sets  £yZ  = 0 to  allow  for  the  equation 

(4)  ip  = - 3w/8y. 

Thus,  the  averaged  moment  equation  becomes 

2 

(5)  M - - / z av  dA*  = E I ~ 

A*  dyd 

and  consequently  from  (3)  stress  may  be  expressed  as 
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where  I Is  the  second  moment  of  Inertia  about  the  neutral  axis. 


Taking  a deformed  beam  element  with  sides  perpendicular  to  the 
deflected  centerline  [3],  Figure  1 , a sum  of  vertical  forces  and  moments 
yields  the  equations 

2— 

(7)  q - mw  - Fq!  - Fpg  + + Ny  JMSr  = 0 

(8)  Q = 


where  Q Is  the  shear  resultant,  Ny  Is  the  in-plane  load,  q Is  a distributed 
beam  load,  m Is  the  beam  mass,  ana  the  Fn^,  1 = 1,2,  are  external  dampings. 
From  Baker,  Wool  am,  and  Yound  [1]  It  Is  known  that 


(9) 

FD1  - 1 

(10) 

C2 

fd2  = r 

|3wi  3w 
3t  at 


9w 

at 


with  the  details  of  C-j  and  C2  being  amply  discussed  in  that  reference. 
Applying  equations  (9)  and  (10)  to  equation  (7), 
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Now,  consider  internal  viscous  damping  according  to  the  standard 
linear  model 

(12)  (1  + C 3/3t)ay  = (E  + E*  3/3t)ey 

where  E Is  Young's  modulus  and  C and  E*  are  viscoelastic  material  constants. 
Combining  equations  (2)  and  (4)  results  in 

(13)  ey  - - z 2% 

3y 

and  a direct  substitution  of  this  equation  into  equation  (12)  yields 

(14)  (1  ♦ C f^0y  * ' *(E  ♦ E*  fj>  ^ • 
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Multiplying  (14)  by  z and  integrating  over  the  cross-sectional  beam  area 
results  in 


(15)  (1  + c|f)M  ■ (E  + E*ij)I  0 . 

Now,  assuming  that  C Is  Independent  of  y,  taking  a partial  derivative  with 
respect  to  y of  equation  (15)  results  in 

(16>  (1  + c f - C(E  + E* &1 

and  multiplying  equation  (8)  by  (1  + C 3()/9t)  yields 


(17)  (l  ♦ c kjq  > - (1  ♦ c-  |j)  g . 

Hence, 

(18)  (1  + C |r)Q  = - 2-  [(E+E*  3 )l  ^ 

at  gy  9t  gy2 

and  a first  derivative  of  this  equation  is 

(,9)  + c !r>  !£  ■ - -4 [(E  + E*  It*1  ^ 

°Z  dyd 

Multiplying  equation  (11)  by  (1  + C 3()/9t) 

<1+C  |j)q  - 0«  fej.  * - <HC  y J-  l&l  “ 

(20>  ’ (1+c  It)  r “ + (,tc  It)  I? + (,+c  ft)Ny  § " 0 

and  a direct  substitution  of  equation  (19)  Into  equation  (20)  results  In 
the  non-linear  displacement  equation  of  motion 
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(21) 


•i—  [(E+E*  i_) I i-J]  - (1+c  4_)nv  <q* 
av2  at  av2J  n'yZJ 


- (i+c  i_)q  + (i+c  i_)  1^1— 

at  at  i 'at'at 

c 

+ (1+c  1_)  -2  9w  + (i+c  JL)m  w = 0. 
at  a at  at 

Equation  (21)  may  be  linearized  by  setting 

(22>  if?  1 1?  ■ 0 

and  hence  the  linear  form  of  the  equation  is 

[(E+E*  f-)I  ^|]  - (1+C  )Ny  £* 


(23)  - (1+C  ft),t<1+c|t)ri?+  (1+C  ft)"®’0- 

In  the  case  of  elastic  stability,  a compressive  load  is  applied  and 
the  substitution 

(24)  Ny  - - P 

is  made.  Also,  since  the  material  parameter  C has  been  assumed  independent 
of  y it  Is  now  also  assumed  for  consistency  that  the  elastic  and  material 
parameters  E and  E*  are  also  independent  of  y. 

The  corresponding  boundary  conditions  for  conservative  flutter  loads 


[E  |—  (I  !?£)  + E*  !_  (I  -iJL)  - Nv  ^ 

3y  3y  ay2at  y 3y 


- C Nv  =0  or  w=0  on  x=0,£ 

y 3y9t 
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[EI^+E*I  -^H-O  or  3v  ” 0 
A + oy 


sy 


3yc3t 


(26) 


on  y=0,Jt. 


Special  consideration  is  necessary  for  non-conservative  flutter  loads; 
for  example,  the  boundary  conditions  for  non-conservative  flutter  loads 
of  cantilever  Euler  beams  of  length  a are  for  the  clamped  end 


(27)  w = 0,  ^ = 0 at  y = 0 

3y 


and  for  the  free  end 


<E+E*  !t>  If  (I  ■ «• 


(28)  (E+E*  i_)I  ^4  = 0 at  y 

3t 


= i. 


3.  THE  ADJOINT  VARIATIONAL  PRINCIPLE.  The  adjoint  to  equation  (23) 
will  now  be  determined.  Equation  (23)  in  operator  notation  with  w0  - 0 is 


(29) 


LM  = - q - C fa 


where 


L[w]  ■ - E [I  &}  - E*  lL  [I 


3y 


3 f 


3y 


3y23t 


(30) 


+ N A + C N — 

”v  2 L ’y  ? 


3y 


3y  3t 


_2 

l 3t 


C C, 


p 2 -|  3 W » S w 

- C“T  + m]  — r - C m — - 


3t* 


Now,  the  adjoint  equation 
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(31) 


L*[v]  ■ - q - C If 


Is  sought. 

In  general,  the  Lagrange  Indentity 

(32)  v L[w]  - wL*[v]  = P(w ,v), 


where  P(w,v)  Is  the  bilinear  concomitant,  can  be  Integrated  to  yield  the 
Green's  formula 

a a 

(33)  / (v  L[w]  - w L*[v])dy  = P(w,v)|  . 

0 0 

Notice  from  equations  (29)  and  (31)  that  their  right  sides  would  cancel 
In  applying  equation  (33),  hence  one  may  simply  begin  with  the  homogeneous 
form  of  operator  (30), 

(34)  L[w]  = 0. 

A formal  solution  for  the  adjoint  to  operator  (3)  Is  obtained  by  starting 
with  the  equation 

t*i  £ 

(35)  / / v L[w]dy  dt  = 0 

t n 0 


and  manipulating  by  means  of  integration  by  parts  towards  the  Green's 
formula  (33).  A direct  substitution  of  operator  (30)  Into  equation  (35) 
results  In  the  equation 


ti  i 

I Jv| 
t0  0 


3 rr  3 Wl  r+  32 


EiLT[I  2 


] - E* 


3y‘ 


[I 


aw  i 

3y^3t 


dy  dt  = 0; 


(36) 


+ N, 


3y2 


3y2at 


-i  3W 

at 


- [- 


c c. 


+ m] 


a2w 

at2 


C m 


ajw 

St3 
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omitting  the  details  of  numerous  Integrations  by  parts,  the  eventual 
result  Is 


-E  [I  - E*  ^ [I  -V 
9y^  9y^  9y^  9y^9t 

/V  V I? 


y 9yZ9t  ^ 


C ^2  9^w 

- C"T^  + ^ 9 - c m H 

* 9t2  9t3 


E^i  [I^2] " E*^r  [I^r"] 

9y^  9y  9y^  9y49t 


2 3 

- 2—  (N„V)  + C (v  N ) 

9y^  J 9yz9t  J 

C2  9v  /C  C2  , 92v  93v 

T«,,-t,)„2-C"s(3 


-[E  f (I  2^)v]  - E (I  4)  3v 

L 9v  *>..2  ^2  sv 


+ E(i  9_V)  M _ E 3 (i  9-|)  w 

9yZ  °y  °y  8yZ 
. c*  9 / T 93w  , _*  ,T  93W  , 1 V 


\2U  «2. 


2 

9_Vx  9w 


+ E*  /T  o.,  Bw  _ r*  9 (I  9__V\ 
E 9y2>  W E 97  (I  3/ 


3W  x 3 /„  \ 

v N rr  + t—  (Nu  v)  w 
y 9t  9y  y 


CvN  + C |-  (v  Ny)  |5> 

y 9y9t  3y  y 3t 
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A 

+ / 


E*  « ^ (I  - cli  (v  Ny)  w 

9y  3yZ3t  3y2  y 

+ r <v  w> + (ir + m> v If 

c c 

- (“T"^  + m)~-w  + Cmv  ^ 


3t 


r 3v  9w  ^2 

-Cm?I  iT+c,"fi« 

- 9t 


dv 


dy; 


SeetohLrtv?abIe?hU’sJner  ^ ^ ‘ Were  asS«d  ^dependent  of 
Now,  without  loss  of  generality  the  usual  restrictions  that 


(38)  w = §£  = v = 0 at  t = t0,  t1 

are  Imposed  on  equation  (37)  with  the  result  being  that 

*1 


(39) 


i 


A 

/ 


a2  A a2 
E*  w 2 ^ 2 ^ “ C — 2 Mw 

9y  3y  3t  3yZ  y 


C C2 


+ f <v  «)  + (T^+m)  v ft 

CC^  3v_  3^w 

■ ( £ + m)  3 w + C m v —? 

3tz 

r 3v  3w  „ 32v 

L'C^n  + c^« 


dy=0. 


the  resu,t  with  “ 
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L*[v]  = E i-r  [I  - E*  i,  [I  -V-] 
3y  3y^  3y^  3y  3t 


(40) 


2 3 

3 (Ny  v)  + C -4—  (v  Nv) 
3yz 


3y23t 


— — + + m)^.Cin4 


Jl  3t  & gt 

and  the  bilinear  concomitant  being 


3t- 


P(w,v)|  = 

0 


(41) 


(I^)v  - E 


32w  3v 


3y  gy  2 

+ E I 


3y 


2 sy 


^ f*  - E i-  (I  a!»)  w 


3y2  3y  3y  9y2' 


+ E*  I-  (I  -4s-) v - E*I 


8y 


3y23t 


3y23t  8* 


+ E*i  E*  1.  (i  3W 

gy2  3y3t  3y  U gy2^  3t 


- v Nu  S + — (Ny  v)w 


7 3t  ay 


_ r v n 3,  W^_  + r 3 

„ L v Ny  ayat  L 


3y  (V  V 


3w 

at 


o 


The  original  and  adjoint  boundary  conditions  are  now  determined  by 
setting  the  bilinear  concomitant,  equation  (41),  equal  to  zero.  The 
problem  here  Is  In  deciding  which  terms  to  group  out  as  the  original 
boundary  conditions  and  consequently  which  terms  will  group  out  as  the 
adjoint  boundary  conditions  --  a simple  Inspection  of  equation  (41) 
Indicates  that  several  choices  are  possible. 

In  the  case  of  the  non-conservative  flutter  loads  of  cantilever  Euler 
beams,  the  original  boundary  conditions  were  postulated  in  the  form  of 
equations  (27)  and  (28).  Based  upon  equation  (28),  the  expressions 
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(42) 


(I  2")  + E*i-  (I 
3y*-  °y 


J l 

-V-)]v|  =0 
ayzat  o 


[E  I + E*I 

9yz 


a3w 

ay2at 


.]  3v. 
J ay 


I • 0 


may  be  directly  Isolated  from  the  bilinear  concomitant  (41)  with  a 
corresponding  reduction  to 


l 

P(w,v)|  = 
0 


(43) 


E 1 |^  - E |_  ( I A)w 

3y2  W 3y  3y2 


__  i 


+ E*I 


a v a w E*  a_  / t a_v  \ aw 
2 ayat  " ay  1 9y2;  at 


ay 


v Ny  It  + ly  (Ny  v)w 


- C V 


2 

acw 


V ayat 


+ C (V  Ny) 


aw 

at 


Now,  equation  (27)  dictates  that  the  remaining  terms  in  (43)  group  out  as 
products  of  either  w or  3w/3y;  if  this  is  to  occur,  the  terms  with  time 
derivatives  of  w must  be  put  into  a different  form.  To  accomplish  this, 
equation  (43)  is  Integrated  with  respect  to  time  to  yield 


Tt  32v  3w  f 3_  /t  a2v 
2 ay  ay  u *„2 


t*|  % t* 

/ P(w,v)|  dt =/ 
tn  0 t_ 


(44) 


)w 


ay£ 


dyc 


92u  ^2  2 

+ E*I  — -w-  . r*  3 fT  9 v\  3w 

ay2  3yat  a^1  It 


v Ny  It  + ly  (Ny  v)w 


c v Ny  r?r  + c I-  (v  Ny> 
y ayat  ay  y 


aw 

at 


dt 


and  those  terms  with  time  derivatives  of  w are  integrated  by  parts  with 
equation  (38)  appropriately  applied  to  result  in 
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(45) 


ti  % 

j P(w,v)|  dt  = 
to  0 


= / 


[EI 


2 

3 V 


3y 


a 

9y29t 


2 - ' v Vc  My  ft]  57 


+ f - E |-(I  ^-)+E*  |_(I  J»)*  |— (NyV)-C  * (v  Ny) 

L 9y2  9y  9y23t  ay  y 9t9y  y 

From  equation  (45),  It  is  clear  that  one  can  set 


w 


(46) 


92v 

[EI  - E*I  — w — 
9y^  9y*9t 

2 


3 & 

9v  - vN  + CN  9vl— I =0 

v "y  u "y  5tJ  3y'0 


[-E  f (I  ^)+E*  » (I  «L(Nyv)-C  JL(v  Ny)]w|^  = 0 

3y  3yz  9y  3yZ9t  9y  y at9y  ^ g 


and  as  a result  of  this  the  bilinear  concomitant  Is  zero 


dt. 


(47)  P(w, v) | =0 

0 

such  that  the  formal  adjoint  (40)  may  now  properly  just  be  termed  the 
adjoint.  Since  equations  (27)  and  (28)  are  the  original  boundary  conditions, 
their  satisfaction  In  equations  (42)  and  (46)  leaves  the  equations 


(48) 

and 


v = 0,  f“=0  on  y = 0 


[EI  - E*I  -^2-  - * N + C Ny  |f]  * 0 
jZ  y * dt 


9y 


9yt9t 


[+  E I-  (I  U)  - E*  i_  (I  J>)  - (N„v)  + 


3y  “ ay^dt'  3y  ' y 


(49) 


+ £-L  (v  Ny]  - o 


3t3y 


on  y = i 


for  the  adjoint  boundary  conditions. 
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4.  THE  DIMENSIONLESS  PROBLEM.  At  this  point  the  original  and  adjoint 
differential  equations  together  with  their  respective  boundary  conditions 
are  known.  Before  proceeding  on  to  the  adjoint  variational  principle, 
it  would  be  advantageous  to  develop  a dimensionless  formulation  of  the 
problem  at  hand  and  to  discuss  possible  types  of  flutter  load. 

To  begin  with,  the  dimensionless  variables 

x = y/l 

(50) 

T = t/O 


are  taken  and  it  is  assumed  that  the  beam  area  and  moment  of  Inertia  may 
take  the  form 

I(x)  = I0I(x) 

(51)  n, 

A(x)  = AqA(x). 

Also,  the  compressive  load  N = - P,  equation  (24),  is  introduced  Into  the 
equations  at  this  point.  For  general  convenience,  the  following  dimension 
less  variables  are  applied  throughout  the  development 

a(x)  = I (x) 


y = E*/Ea 
p(x)  = P(xH2/EI0 

(52) 

i = C/a 

q"  = C2£3/EI0a 

% 

£5(x)  = p AqV  A(x)/EIqP2. 

Thus,  a direct  application  of  equations  (50-52)  to  equations  (27),  (28), 
(29),  and  (30)  together  with  a bit  of  manipulation  and  the  assumption  that 

(53)  q = 0 

results  in  the  dimensionless  original  differential  equation 


£,  [«(*)  + T [o(x)  + p(x)  £& 

9x^  3x^  3x^  3x^3t  3x* 

(54)  3 2 3 

+ £p(x)  + q ~ + C5q+B(x)]  —•  + 5B(x)  = 0 

3x^3t  dT  3t^  3tj 

together  with  the  non-conservative  dimensionless  original  boundary  conditions 
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3w 

(55) 

£ 

It 

O 

s* 

1 

= 0 

on  x=0 

3x 

and 

3 

32w 

3 dW 

[1  + Y — ]a(x)  ~z  = 0 

3t  3x^ 


(56)  ? on  x=l 

3 3 3 w 

[1  + Y — ■]  — [a(x)  -=■]  = 0 
3r  3x  3x£ 

for  a cantilevered  Euler  beam.  Similarly,  equations  (50-53)  applied  to 
equations  (31),  (40),  (48),  and  (49)  result  in  the  dimensionless  adjoint 
differential  equation 


32v 


33v 


(57) 


— o [a(x)  — =■]  - Y — p 0(x)  — =— ] + — r [p(x)v] 

3x^  3x2  3x^  3x23t  3x2 


33v 


33  _ 3v  3 2v 

- 5 — — [p(x)v]  - q — + [?q  + 6(x)]  — - 56(x) 

3t  3t^  St'3 


3x23t 


= 0 


and  the  dimensionless  adjoint  boundary  conditions 


(58) 

and 


3v 

v = 0,  — = 0 

3x 


on  x=0 


3 2v 


33v 


3v 


a(x)  — j - Ya(x)  — « — + p(x) v - Cp(x)  — =0 
3x  3x^3t  3t 


3^v 


33v 


[a(x)  — o]  - Y — [a(x)  — ? 
dx  3x‘  3x  3x^3t  3x 


■]  + — [p(x)v] 


(59) 


- 5 


3x3x 


[p(x)v]  = o 


on  x=l 
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Some  examples  of  the  loading  P(x)  for  several  distinct  flutter  problems 
are 

(60)  P(x)  = P 

for  Beck's  problem  [4]  which  assumes  a concentrated  load  is  applied  tangent 
to  the  free  end, 

(61)  P(x)  = q(l-x) 

for  the  Leipholz  problem  [5]  which  assumes  that  a uniformly  distributed  load 
acts  tangentially  along  the  centerline  of  the  beam,  and 

(62)  P(x)  = ^q0(l-x)2 

for  Hauger's  problem  [6]  which  assumes  that  a linear  distributed  tangential 
load  acts  along  the  centerline  of  the  beam.  These  are  merely  examples  of 
what  can  be  handled,  the  analysis  can  be  applied  to  any  problem  which  satis- 
fies equation  (54),  with  the  P(x)  now  being  specified  for  this  problem,  and 
the  corresponding  boundary  conditions  (55)  and  (56). 


5.  THE  ADJOINT  VARIATIONAL  FORMULATION.  Based  upon  a suggestion  by 
Dr.  Gary  Anderson,  considerable  simplification  in  developing  the  varia- 
tional principle  is  possible  if  solutions  of  the  form 

w(x,t)  = W(x)e^T 

(63)  ' XT 

v(x,t)  = V(x)e  *T 

are  immediately  assumed.  A direct  application  of  solution  (63)  to  the 
original  problem  (54-56)  results  in  the  differential  equation 

92  92W  a2w 

[a(x)[l+Ay]  -— =■]  + [1+X5]p(x)  — , 

9x^  9x2  9x^ 

(64)  + [Aq(l+A£)  + A2B(x)  [1+A£]]W  = 0 
and  the  boundary  conditions 

9W 

(65)  W = 0,  — =0,  on  x=0 

9x 

and 
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(66) 


92W 

(l+yX)a(x)  — r = 0 
9x^ 

3 92W 

(1+yX)  — [a(x)  — 7\ 
9x  9x‘ 


= 0 


on  x=l 


Likewise,  applying  solution  (63)  to  the  adjoint  problem  (57-59)  results  in 
the  differential  equation 


9' 


9 2V 


32  * 


(67) 


—7  [0+YX)a(x)  —A  + “7  [(l+X£)p(x)V] 
Sx^  9x^  9x^ 

+ [Xq(l+XC)  + X23(x)  [1HX]]V  = 0 


and  the  boundary  conditions 


(68) 

and 


V = 0, 


3V 

— = 0 
9x 


on  x=0 


92V 

(l+yX)a(x)  — r- + (l+X£)p(x)V  = 0 
9x2 


(69) 


3x 


9^V  3 

C(l+Xy)a(x)  — r]  + — [(l+A5)p(x)V]  = 0 on  x=l 
9x^  3x 


Now,  a variational  principle  based  upon  equations  (64-69)  will  be 
developed  in  terms  of  potential  energy  V*  such  that  the  ultimate  result 
is 


(70) 


5V*  = 0. 


To  begin  with,  equation  (64)  is  multiplied  by  6V  and  integrated  over  x 
1 


J 

0 


(71) 


92  92W 

— - [(l+Xy)o(x)  — ]5V 


9x‘ 


92W 


9x* 


+ [1+X£]p(x)  — sv 


3X 


+ [(l+X?)Xq  + (l+X5)X26(x)]W6V 


dx=0. 
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After  an  application  of  integration  by  parts  and  a bit  of  manipulation, 
equation  (71)  may  be  put  into  the  form 


1 


« / 
0 


92W  92V 
[1+Ay)a(x)  — p — p 

9W  9V  9p(x)  9W 

- [p(x) + — V]  (1+XC) 

9x  9x  9x  9x 

+ [(1+XOXq  + (1+AOA2B(x)]WV 


dx 


3 92W  9W 

( — [(1+X§)a(x)  — r]  + O+K)  ~ p(x))6V 


1 


9x 


3x‘ 


9x 


9^V  9 


+ (—  [(1+Xy)a(x)  — d + — C(l+X5)p(x)V])6W 


9x 


3x  9x 


(72) 


92W 

- [(1+Xy)a(x)  -r] 

3x2 

92V 

- [(1+Xy)a(x)  — ] 

9x^ 


9W 

«(— ) 

3x 

9V 

«(— ) 

3x 


= 0. 


In  looking  at  equation  (72),  it  is  clear  from  equations  (65-66)  and  (68-69) 
that  the  following  boundary  conditions  are  immediately  satisfied 

9V  9W 

V = w = — = — = 0 on  x=0 

3x  9x 


(73) 


92W 

(l+Xy)ot(x)  — j = 0 
9x^  _ 

9 32V 

— [(1+Ay)a(x)  — -]  + 
9x  3x^ 


on 


3 

9x 


C(l+xc)p(x)v]-o 


x=l 


on  x=l 


9 92W 

(1+Ay)  — [a(x)  — -]  = 0 on  x=l 

9x  9x^ 
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and  that  the  boundary  condition 

92V 

(74)  (l+yX)a(x)  — T + (l+X?)p(x)V  =0  on  x=l 

9x2 

must  still  be  satisfied.  If  it  is  now  stipulated  that  boundary  condition 
(74)  must  also  be  satisfied,  equation  (72)  can  be  put  into  the  form 


aw 

(75)  + <5[(l+XC)p(x)  — V]  =0. 

3x  x=l 

To  arrive  at  the  variational  principle  (70),  it  is  now  assumed  that  the 
potential  energy  may  be  broken  up  as 

(76)  V*  = V*  + V*, 
where 

(77)  V*  = I / V,  dx 

1 2 o 1 


(78)  V*  = J V«  6 (x-1 )dx 
L 0 

and  equation  (78)  reflects  the  introduction  of  the  Dirac  Delta  function 
6(x-a  ) with 


(79)  / 4>(x)  <5(x-a  )dx  = <f>(a  )/2 

0 

being  one  of  its  most  useful  integral  properties.  Thus,  applying  equations 
(76-78)  to  a comparison  of  equations  (70)  and  (75)  results  in  the  potentials 
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(80) 


V1 


[(1 +Xy)a(x) 


92W  92V 
9x2  9x2 


9W  9 

- (1+X5)  — [p(x)V] 

9x  9x 

+ [O+XOXq  + (1+XOX23(x)]WV 


9W 

(81)  V2  = (l+XOp(x)  — V. 

C 9x 

The  calculus  of  variations  will  now  be  used  to  determine  the  Euler- 
Lagrange  equations  associated  with  equation  (70).  Clearly,  the  functional 
dependencies  of  V-)  and  V2  are 


(82) 


9V  9W  92V  92W 

Vi  = V,  (x  : V,  W,  — , — , — ) 

9x  9x  9x2  9x2 


V2  = V2  (x  : V,  9W/9x) 

and  it  may  be  postulated  that 

1 9V  9W  92V  92W 

I(V,W)  = f V,  (x  : V,  W,  — , — , — , — y)dx 

0 9x  9x  9x^  9x^ 

1 9W 

(83)  +/  V2(x  : V,  — ) 6 (x-1  )dx 

0 9x 

such  that 


(84)  61  = 0. 


A back  substitution  of  equation  (84)  into  equation  (83),  integration  by  parts, 
and  an  application  of  the  first  lemma  of  the  calculus  of  variations  leads  to 
the  Euler-Lagrange  equations 


(85) 


92  9V,  9 9Vi  9V, 

(_L)  - _ ( — L)  + JL  = 0 

9x2  9VXX  9x  9VX  9V 

92  9V,  9 9V,  9Vi 

-T(— ) --(—)'+—■  o 

9x^  9WXX  9x  9WX  9W 
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and  the  boundary  conditions 


(86) 


9 9Vi  9Vi 

- — ( -)  + = 0 

9x  9VXX  9VX 


9 9Vn  9V-.  9V? 

- — ( -)  + — + — = 0 

9x  9 Vxx  9VX  9V 


9 9Vi  9Vi 

- ( -)  + — = 

9x  9WXX  9WX 


9V-| 


0 


9V]  3V-, 

+ = 0 

9WXX  SWX 


0 


or  V=0  on  x=0 


or  V=0  on  x=l 


or  Vx=0  on  x=Q,l 
or  W=0  on  x=0,l 


or  Wx-0  on  x=0 
or  Wx=0  on  x=l . 


A direct  substitution  of  equations  (80-81)  into  equations  (85-86)  results  in 
an  exact  reformulation  of  the  original  problem  (64-66)  as  well  as  the  adjoint 
problem  (67-69).  This  verifies  that  the  variational  principle  (70)  is  indeed 
correct  when  V*  is  defined  by  equations  (76-78)  and  (80-81). 


6.  THE  GENERALIZED  RITZ  APPROXIMATION.  Following  Anderson  [7],  a natural 
extension  of  the  Ritz  method  in  its  classical  form  is  applied  to  the  current 
adjoint  boundary  value  problem.  The  approximation  is  made  in  terms  of  a finite 
series  expansion  of  the  product  of  prespecified  coordinate  functions.  The 
approximation  will  now  be  made  as  the  two  series 

W(x)  « l a W (x) 
n=l  n n 

N 

(87)  V (x)  - l bn  V (x) 

n=l 

where  Wn(x)  and  Vn(x)  are  the  coordinate  functions  which  must  be  prescribed  and 
an  and  bn  are  constants.  Thus,  the  general  procedure  involved  will  be  to  select 
Wn(x)  and  Vn(x)  subject  to  certain  boundary  constraints  and  then  to  use  the 
variational  principle  to  find  a characteristic  equation  for  flutter  load.  In 
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general,  it  is  believed  that  satisfaction  of  all  or  most  of  the  boundary 
constraints  tends  to  minimize  convergence  time  [7].  However,  if  those 
terms  in  the  natural  boundary  conditions  which  contain  derivatives  of 
odd  order  of  time  [7]  are  deleted,  then  the  coordinate  functions  which 
were  previously  developed  for  the  undamped  beam  may  be  employed  directly 
in  the  current  problem.  Since  the  geometric  boundary  conditions  remain 
unchanged,  no  extra  consideration  is  warranted  in  this  case.  Thus,  going 
back  to  natural  boundary  conditions  (56)  and  (59)  with  the  assumption 
that  odd  order  derivatives  in  time  are  deleted  and  tracing  through 
ensuing  developments  results  in  the  boundary  conditions 


(88) 


and 


(89) 


3^w 

a(x)  — = 0 
3x^ 

on  x=l 

3 3^w 

— [a(x)  — -]  = 0 

3x  3x^ 


a(x)  — « + p(x)v  = 0 
3x^ 

— Cct(x)  + — [p(x)v] 
3x  3x^  3x 


on  x=l 


0 


which  are  now  utilized  in  place  of  boundary  conditions  (66)  and  (69). 

Following  reference  [8],  it  is  the  current  intention  to  choose  the 
Wn(y)  and  Vn(y)  as  general  polynomials.  The  polynomials  will  be  chosen 

such  that  there  is  one  term  for  each  boundary  condition  plus  one,  the 
highest  power  is  one  more  than  the  highest  order  derivative,  and  the 
n the  order  polynomial  is  of  the  form 


(90)  AD  (a-i  + + a5  x4). 

Hence,  the  assumed  forms  of  the  coordinate  functions  are 

(91)  Wn(x)  = x^n"^(a]  + a2X  + agx^  + a^x^  + a^x4) 

(92)  Vn(x)  = x^n~^(bi  + ^x  + b3x^  + b4x^  + b5x^)- 
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Now,  the  constants  in  coordinate  functions  (91)  and  (92)  are  evaluated 
by  substitution  of  these  equations  into  boundary  constraints  (65),  (88)  and 
(68),  (89),  respectively,  which  they  must  identically  satisfy.  Thus,  a 
direct  substitution  of  equation  (91)  into  the  boundary  conditions  results 
in  the  four  constants 

a-|  = 0,  a2  = 0 

(93)  a3  = C(n+3)(n+2)2(n+l)a(l)]a5/d 

a4  = [2(n+3)(n+2)(n+l )na(l )]a5/d 

where  the  denominator  term  d is  defined  as 

(94)  d = (n+2)(n+l)2na(l). 

Since  the  a^  in  equations  (93)  is  really  just  an  arbitrary  constant  if  traced 
through  to  its  origin  in  equation  (91),  substantial  simplification  is  realized 
by  choosing  it  as 

(95)  a5  = d/[(n+2)(n+3)a(l )]. 

A direct  back-substitution  of  equations  (93-95)  into  equation  (91)  results  in 
the  final  form  of  the  coordinate  function  being 

(96)  Wn(x)  = x(n+l)[(n+3)(n+2)  - 2(n+3)n  x + (n+l)nx2]. 

Similarly,  a direct  substitution  of  equation  (92)  into  the  boundary  conditions 
results  in  the  four  constants 

b-j  = 0,  b2  = 0 

(n+3) (n+2) (n+1 )a2(l ) + 2n(n+2)a(l )p(l ) 

3 2(n+2)a(l ) p ' (1 ) + 2(n+2)a'  (1 )p(l ) + P2(l)_ 

-2(n+3)(n+2)(n+l )n  a2(l)  - 4n(n+1  )a(l  )p(l ) 

b4  = 

L+2(2n+3)a(l)p'(l)-2(2n+3)a  (1 )p(l )-2p2(l ) 

where  the  denominator  term  d is  defined  as 

= (n+2)  (n+1  )2n  a2(l)  + 2 ( n+1 ) (n-1  )a(l  )p(l ) 

(98)  - 2 (n+1 )a(l )p‘ (1 ) + 2(n+l )a‘ (1 )p(l ) + p2(l ) 

and  the  prime  notation  indicates  derivatives  with  respect  to  coordinate  x, 
i.e.  1 = d/dx. 
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Now,  equations  (97)  reduce  equation  (92)  to 

(99)  Vn(x)  = x(n+1)[b3  + b4x  + bgx2] 
which  upon  letting  bg  = cf  results  in 

(100)  Vn(x)  = x(n+1^[an  - 2Bnx  + y^2] 
where 

an  = (n+3) (n+2)2(n+l )a2(l ) + 2n(n+2)a(l )p(l ) - 2(n+2)a(l )p' (1 ) 

(101)  + 2(n+2)a'(l)p(l)  + p2(i) 

Sn  = (n+3)(n+2)(n+l )na2(l ) + 2n(n+l )a(l )p(l ) - (2n+3)a(l )p' (1 ) 

(102)  + (2n+3)a'(l)p(l)  + p2(l) 

yn  = (n+2)(n+l )2n  a2(l)  + 2(n+l )(n-l )a(l )p(l ) - 2(n+l )a(l )p' (1 ) 

(103)  + 2(n+l  )a' (1 )p(l ) + p2(l). 

With  the  coordinate  functions  W (x)  and  V_(x)  now  determined,  it  remains 
to  solve  for  the  flutter  load  p (x ) . "The  approximate  solutions  (87)  are  now 
back  substituted  into  the  potential  energy  expression  given  by  equations  (76- 
81 ) to  yield 


V * — 


k-K  n=N  1 


(1+Xy)  l l G . / a(x)wA ' (x)v' '(x)dx 
k=l  n=l  nk  0 K n 


k=K  n=N  1 

-(1+X£)  ^ Gpk  J p(x)Wk(x)Vn(x)dx 
k=K  n=N  1 

■O+AC)  l l G k / p'(x)Wk(x)V  (x)dx 
k=l  n=l  nK  0 K 

_ k=K  n=N  1 

+(l+X?)Xq  l [ Gk|  Wk(x)V  (x)dx 

k=l  n=l  nK  0 k 

9 k=K  n=N  1 

+0+XC)X  l l Gnk  / 8(x)Wk(x)Vn(x)dx 

k=l  n=l  0 
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1 k=K  n=N  , 

+ 2 (1+’'°  k=l  n-l  GnkP<X>Wk<x)Vn(x)l(=i 


004) 


where 

(105)  Gnk  = ak  bn. 

Now,  a consideration  of  equation  (104)  shows  that  the  following  definitions 
are  possible 

1 

(106)  Ank  = J B(x)Wk(x)Vn(x)dx 


(!°7)  Bnk  = J a(x)Wk‘(x)V|!)'(x)dx 


1 

(1°8)  Cnk  = / p(x)Wk(x)Vn(x)dx 

(109)  Dnk  = / p'(x)Wk(x)Vn(x)dx 

1 

(110)  Enk  = J Wk(x)Vn(x)dx 


(HI)  F k • p(x)w‘(x)Vn(x)| 


nk  „ - ,x=1 

and  hence  equation  (104)  may  be  rewritten  as 


V*  = 1 
2 


k=K  n=N  k=K  n=N 

(,+Xy)  ,1  I *nkBnk-f+^>  X j.  GnkCnk 

k- 1 n- 1 k * n I 

k=K  n=N  _ k=K  n=N 

-d+X5)  I I GnkDnk+(1+X5>Xq  . I,  I,  GnkEnk 

k=l  n=l  k=l  n=l 

k=K  n=N 

+(1+XE)A.^  £ I ^nk^nk 
- k=l  n=l 
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(112) 


1 k=K  n=N 

+ “ (l+x^)  l l GnkFnk* 
k=l  n=l 


Thus,  the  variational  principle  (70)  is  now  applied  to  equation  (112)  with 
the  functional  dependence  of  V*  being 

(113)  V*  - V*(ak,  bn) 

and  the  resulting  Euler-Lagrange  equations  being 

3V*  3 V* 

=0,  =0. 

(114)  3ak  3bp 

Applying  equations  (114)  to  equation  (112)  and  recalling  from  equation  (105) 
that  Gnk  = akbp  results  in  the  conditions 


(115) 


(l+Ay)Bnk  - (l+A£)Cnk 
-d+AODnk  + (l+x?)Aq  Enk 
+(l+AE)A2Ank  + (l+A£)Fnk 


(116) 


k=K  T <1+X*>Bnk  - '1+X5>  Cnk 

j -(l+«)Dnk  + (l«5)Xq  E„k 
k=l 

J-(1+AE)A^  Ank  + (1+A£)  Fpk 


Clearly,  equations  (115)  and  (116)  are  equivalent  and  hence  it  is 
permissible  from  this  point  on  to  consider  only  one  of  these  equations. 
Hence,  working  only  with  equation  (115)  it  is  possible  to  obtain  the 
critical  flutter  load  by  setting  the  determinant  of  the  K equations  thus 
represented  equal  to  zero  with  the  resulting  characteristic  equation 
being 

(£  Ank^  + (Ank  + Fnk^ 

DET  +(y  Bnk  - £ Cnk  - £ Dnk  + q Enk  + £ Fnk)A 

(^)  +(&nk  " Gnk  **  Dnk  + Fnk^ 

Now,  equation  (117)  may  be  solved  numerically  on  the  digital  computer  for 
critical  flutter  load  once  the  necessary  material  and  geometric  properties 
are  specified. 
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7.  THE  OPTIMIZATION  OF  HAUGER'S  PROBLEM.  Hauger's  problem  [6]  will 
now  be  optimized  within  the’  bounds  of  a special  class  of  generalized 
parabola  type  boundary  curves.  The  beam  thickness  will  be  expressed 
functionally  in  terms  of  length  axis  displacement,  but  the  beam  width  b 
will  be  held  constant  and  thus  any  cut  perpendicular  to  the  length  axis 
will  have  a rectangular  cross-section. 

A generalized  parabola  type  curve.  Figure  2,  is  passed  through  the 
point  z=a  at  the  clamped  end  y=0  and  the  point  z=c  at  the  free  end  y=£. 
The  generalized  form  of  this  curve  is  chosen  as 

(118)  z - c = (const. ) U-y)n 

and  the  constant  is  now  evaluated  such  that  at  y=0,  z=a  with  the  result 
being 


(a-c) 

(119)  const.  = . 

£n 


Figure  2,  The  Generalized  Parabolic  Type 
Boundary  Curve 
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Representing  the  beam  half- thickness  as  h(y)  such  that  h(y)=z,  then  from 
(118)  and  (119) 

(120)  h(y)  = c + (a-c) 

The  basic  optimization  problem  to  be  considered  is  that  of  choosing  or  fixing 
the  desired  flutter  load,  in  this  case  q0,  and  then  determining  the  optimum 

design  h(y)  which  minimizes  the  beam  mass. 

The  dimensionless  variables  (50)  and  (52)  are  introduced  and  following 
equations  (51)  the  second  moment  of  inertia  is  expressed  as 

(121)  I(x)  = I0S(x) 

and  in  a similar  vain  the  area  can  be  written  as 

(122)  A(x)  = A0  r(x). 

Now,  a suitable  reference  dimension  will  be  introduced  by  using  length 
as  a characteristic  beam  reference  and  the  dimensionless  beam  coordinates 

(123)  a = cM,  y = a/ fl- 
are introduced  and  applied  to  equation  (120)  with  the  result  being 

(124)  h(x)  = fl[a  + (y-a)(l-x)r|]. 

The  basic  definition  of  second  moment  of  inertia  is  evaluated  for  a constant 
beam  width  b and  thickness  2h(x)  to  yield 

(125)  I = / z3  dA  = \ b h3(x) 

A 3 

which  upon  an  introduction  of  equation  (124)  becomes 

r\ 

(126)  I = jbi3[a+(H(l-x)'1]3; 

letting 

(127)  I0  = |b*3 

and  comparing  equation  (126)  with  equation  (121)  results  in  the  definition 
028)  S(x)  = [a  + (y-a)  (1  -x)1"1]3. 
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A similar  look  at  the  cross-sectional  area 

(129)  A(x)  = 2b  h(x) 
with  the  definition 

(130)  A0  = 2 b A 

being  made  results  in  equation  (122)  yielding  the  definition 


(131) 

r(x) 

= [a  + (y- 

-a)(l-x)1]. 

A comparison  of 

equations 

(128)  and  (131)  shows  them  to  be  functionally 

similar 

and  if  1 * 

the  function 

(132) 

f(x) 

= [a  + (y 

"o)(l-x)i] 

Is  now  < 

iefined, 

then 

(133) 

r(x) 

= f(x) 

(134) 

S(x) 

- f3(x). 

The  basic  optimization  problem  will  be  to  minimize  the  beam  mass  while 
holding  the  flutter  load  constant,  which  can  be  expressed  as 

1 

(135)  / pA(x)dx  = minimum. 

0 

Equation  (135)  is  readily  expressed  as 

1 , 1 

(136)  J pA(x)dx  = 2pirb  J f(x)dx 
0 0 

and  since  p,  l , and  b are  essentially  fixed  variables  the  problem  reduces  to 
minimizing  the  integral 

1 

(137)  / f(x)dx  * minimum 
0 

or  hence  minimizing 
1 

(138)  J f (x)dx  ■ a + [(y-a)/(n+l)]. 

0 
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8.  THE  APPROXIMATE  SOLUTION.  A first  approximation  to  the  current 
problem  will  be  made  by  considering  only  the  first  several  terms  of  the 
generalized  Ritz  solution  for  the  original  and  the  adjoint  problems. 

From  equations  (50),  (52),  and  (62)  it  is  clear  that 

(139)  p(x)  = Q(l-x)2 


where 


(140)  Q = 


Also,  from  equation  (50)  and  (52)  and  equations  (51),  (121),  (122), 
(131),  (133),  and  (134)  it  is  clear  that 

a(x)  = S(x)  = f3(x) 

(141) 

6(x)  = 6 r(x)  = 8 f (x ) 


where 

(142) 


e = 


p A-  t4 


E I0  c‘ 


and  the  now  most  convenient  definition 

A 


(143) 


p An  r 


E I, 


results  in 

(144)  6=1. 

From  equation  (139)  it  is  clear  that 

(145)  p(l)  = p'(l)  = 0 


and  hence  the  constants  (101-103)  in  the  general  Ritz  coordinate  functions 
(96)  and  (100)  reduce  to 

an  = ( n+3 ) ( n+2 ) ^ ( n+1 )a^ ( 1 ) 

(146)  Bn  = (n+3) (n+2) (n+1  )n  a2(l) 

yn  = (n+2) (n+1 )2n  a2(l ). 
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For  later  convenience  in  numerical  calculations  it  is  desirable  that  integrals 
( 1 06 ) - ( 1 1 1 ) be  independent  of  both  a and  y and  that  the  exact  loading  (139) 
now  be  introduced,  hence  equations  (106)-(111)  become 

1 „ 1 __ 

Ank  = a/  yWk(x)Vn(x)dx  + u/  CWk(x)Vn(x)dx 

1 1M 
Bnk  = a3  / ^'(x)v;'(x)dx  + 3a2uf  y2!  wk'(x)Vn  (x)dx 

+ 3a  y2/  ?2Yw',(x)v'‘(x)dx  + y3/  ' (x)V ' ' (x)dx 

0 K o k n 

cnk  = Q / O-x)2  v^(x)dx 

1 

Dnk  = -2  Q / (1-x)  Vn(x)  Wk(x)dx 

Enk  = l Vx)  V*>dx 

(147)  Fnk=  0 


where 

Y = 1 - (l-x)n 

(148) 

X = (1-x)^. 


With  the  integrals  (147)  now  known,  it  is  possible  to  now  evaluate  the 
critical  flutter  load  from  characteristic  equation  (117)  which  may  be 
restated  as 

(149)  DET  | ankx3  + bnkX2  * cn|(»  + dnk  | = 0 

where 
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ank 

bnk 


cnk 

(150)  dnk 


5 Ank 

Ank  + ^ Enk 

YBnk  ’ ^nk  - £Dnk  + q Enk  + ?Fnk 
Bnk  ' cnk  " Dnk  + Fnkf 


To  allow  for  the  utilization  of  available  numerical  procedures,  it  is 
convenient  at  this  time  to  reformulate  equation  (149)  into  a standard 
matri x el genval ue  probl em 


(151)  (R  - XI)x  ■ 0. 

Clearly,  the  matrix  problem  leading  to  equation  (149)  is 


(152)  (A  X3  + B X2  + C X + D)x  = 0. 


If  the  two  definitions 


= ll 

(153)  X i = 1 z 

are  introduced  into  equation  (152),  it  is  readily  shown  that  the  relation 


""I  1 “ 

wm 

- **■ 

i 1 1 | * 

o'  $ < i 

_ i — i — 

X 

1. 

= X 

X 

l 

-A_1D  | -A_1c  \ -A_1B 

z 

z_ 

(154) 

results  and  is  exactly  of  the  form  (151). 


9.  CONSTRAINTS  AND  THE  NUMERICAL  PROCEDURE.  First  of  all , two  Important 
constraints  natural ly  appear  in  the  consideration  of  beam  optimization.  The 
Euler  Beam  Theory  itself  Is  only  valid  for  a thickness  to  length  ratio  less 
that  l/10th.  Since  reference  will  momentarilly  be  made  to  comparisons  of  the 
minimum  design  to  a fixed  rectangular  design  with  a*y  and  since  the  load 
intensity  as  given  in  equation  (62)  increases  as  the  clamped  end  is  approached, 
the  clamped  end  thickness  will  be  taken  greater  than  or  at  most  equal  to  the 
free  end  thickness  and  hence  the  constraint 

(155)  y > a. 
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Hence,  since  y is  the  larger  of  y and  a,  the  Euler  Beam  restriction  of  a 
thickness  to  length  ratio  less  than  1/10  may  be  expressed  as 

(156)  y<  — 

20 

as  the  clamped  end  thickness  is  really  2a.  Practically,  one  end  of  the 
beam  must  be  of  finite  thickness  or  thus  y must  be  finite,  this  condition 
together  with  equation  (156)  result  in  the  constraint  on  y that 


(157)  0 < y < — . 

20 


Now,  with  equation  (157)  in  mind,  the  problem  of  avoiding  negative  areas  is 
accomplished  by  setting  the  free  end  thickness  to  being  non-negative  which 
together  with  equation  (155)  results  in  the  constraint  on  a being 

(158)  0 £ a 5.  y • 

Additionally,  the  calculations  for  an  optimum  design  will  be  related 
to  a fixed  standard  of  a rectangular  beam  with 

(159)  a » y 

and  its  corresponding  flutter  load 


(160) 


Qcrit. 

rect. 


3 q£ 

4 Eb  * 


Thus,  the  method  of  fixing  the  flutter  load  will  also  include  determining  an 
equivalent  rectangular  beam  with  the  same  prespecified  flutter  load  and  using 
it  and  its  dimensions  as  a reference.  In  terms  of  this  rectangular  reference 
beam  and  realizing  that  equation  (158)  must  hold,  an  added  restriction  upon  y 
is  that 


(161)  Prect.  ^ 

where  yrect.  is  now  the  value  of  y for  the  rectangular  reference  beam.  Thus, 

from  equations  (157),  (158),  and  (161)  the  constraints  to  be  used  in  the 
optimization  routine  are 

062)  Vfect. 


(163)  0 < a < y . 
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10.  DISCUSSION.  As  a preliminary  to  the  mass  optimization  work,  some 
results  will  now  be  shown  for  the  stability  of  Hauger's  problem.  The  various 
effects  of  internal  and  external  damping  on  flutter  load  will  be  carefully 
considered. 

Figure  3 shows  the  variation  of  external  damping,  q,  versus  flutter  load, 
Q,  for  various  fixed  values  of  internal  damping,  £,  with  internal  damping  y=0. 
For  each  fixed  value  of  internal  damping,  £,  the  general  tendency  is  for 
flutter  load  to  increase  as  external  damping,  q,  increases  until  a flutter 
load  of  approximately  Q x l(r  = 11.0  is  reached_;  at  this  time  there  is  a 
decrease  in  flutter  load  as  external  damping,  q,  further  increases  with  an 
asymptotic  value  of  Q x 10^  = 8.3  eventually  being  reached.  Notice  that  as 
the  fixed_values  of  internal  damping,  £,  increase  that  the  values  of  external 
damping,  q,  at  which  flutter  load  reaches  a maximum  also  increase.  Also  note 
that  from__this  last  result  it  would  appear  that  for  a given  value  of  external 
damping,  q,  a maximum  flutter  load  can  be  obtained  by  making  a proper  choice 
of  internal  damping,  £. 

Figure  4 now  shows  the  variation  of  internal  damping,  £,  versus  flutter 
load,  Q,  with  external  damping,  q,  now  taking  on  fixed  values  - the  other 
internal  damping,  y,  is  again  zero.  As  internal  damping,  K,  increases, 
flutter  load  begins  at  a value  of  Q x 10^  = 8.3  and  increases  to  a maximum 
of  approximately  Q x ICr  = 11.0  and  then  continually  decreases  for  each  of 
the  fixed_values  of  external  damping,  q.  As  fixed  values  of  external 
damping,  q,  increase,  the  values  of  internal  damping,  £,  at  which  flutter 
load  reaches  a maximum  also  increase.  For  a given  value  of  internal 
damping,  £,  a maximum  flutter^  load  can  be  obtained  by  making  a proper 
choice  of  external  damping,  q. 

Figure  5 shows  the  variation  of  flutter  load,  Q,  with  internal  damping, 

X»  for  fixed  values  of  internal  damping,  £,  for  external  damping  set  at 
q = 1.0.  For  some  of  the  higher  values  of  fixed  values  of  internal  damping, 

£,  there  is  a general  tendency  for  flutter  load  to  increase  as  internal 
damping,  y,  increases  to  a maximum  in  the  range  10.8  < Q x 10^  £ 11.4  and 
then  to  gradually  decrease.  For  some  of  the  lower  values  of  internal 
damping,  £,  flutter  load  starts  out  at  its  maximum  at  internal  damping 
y=0.0  and  then  decreases  as  internal  damping,  y,  increases.  A maximum 
flutter  load  for  a pre-specified  internal  damping,  y,  can  be  obtained 
through  a proper  choice  of  internal  damping,  £.  As  fixed  values  of 
internal  damping,  increase,  the  values  of  internal  damping,  y,  at 
which  flutter  load  reaches  a maximum  also  increase,  but  unlike  the  trend 
in  figures  3 and  4,  the  value  of  this  maximum  tends  to  vary  slightly. 

Figure  6 shows  the  variation  of  flutter  load,  Q,  with  internal  damping, 

£,  for  fixed  values  of  internal  damping,  y,  with  external  damping  set  at 
q = 1.0.  For  the  various  fixed  values  of  internal  damping,  y,  it  is  clear 
that  flutter  load  increases  as  internal  damping,  £,  increases  up  to  a 
maximum  of  Q x 10^  = 11.4  or  less  and  then  flutter  load  decreases  with 
further  increases  in  internal  damping,  £.  As  fixed  values  of  internal 
damping,  y»  increase,  the  values  of  internal  damping,  £,  at  which  flutter 
load  reaches  a maximum  also  increases,  but  the  value  of  the  maximum  varies 
somewhat. 
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Figure  7 shows  the  variation  of  base  thickness  y with  tip  thickness 
a for  internal  dampings  y=  1000.  and  £=  1.0  and  for  external  damping 
T = 1.0  where  the  curves  themselves  are  for  various  fixed  values  of  n» 
the  order  of  the  generalized  parabola  as  given  in  equation  (132).  In 
Figure  7,  the  constraint  y < .05  as  expressed  in  equation  (156)  is 
utilized  and  will  be  held  to  in  all  future  plots.  Curves  for  n = 1,2, 3, 4 
are  shown  and  it  is  clear  that  base  thickness  y increases  as  order  n 
increases  or  as  tip  thickness  a decreases.  The  results  of  Figure  7 
may  be  directly  applied  to  calculate  the  mass  ratio 

(164)  M/Mq  = a + (y-a)/(n+l) 


where 


(165)  MQ  = 2 p i2  b 

as  equations  (164)  and  (165)  directly  result  from  equations  (135)  and  (138). 
Figure  8 shows  the  variation  of  mass  ratio  M/M  for  generalized  parabola's 
of  order  n = 1,2, 3, 4.  It  is  clear  that  the  minimum  mass  ratio  will  occur 
for  zero  tip  thickness,  a = 0,  which  would  yield  a beam  with  a knife  edge 
at  the  tip.  Thus,  in  any  practical  applications  it  would  be  wise  to  specify 
a minimum  tip  thickness  as  a lower  constraint  on  a.  It  is  further  clear 
from  figure  8 that  different  orders  of  generalized  parabolas  yield  the 
lower  value  of  mass  ratio  M/M0  for  different  values  of  a.  For  example, 
n = 2 gives  the  lowest  mass  ratio  in  the  range  0 £ a £ .00825,  n = 3 
gives  the  lowest  mass  ratio  in  the  range  .00825  < a <_  .00915,  and  n = 4 
gives  the  lowest  mass  ratio  in  the  range  .00915  <_a  £ .0167.  It  is  also 
clear  that  if  a beam  with  a knife  edge  were  allowed,  a beam  with  a mass 
reduction  in  excess  of  24.5%  would  be  possible. 

Now,  as  an  example  of  a true  mass  optimization  with  a lower  constraint 
on  tip  thickness  the  beam  represented  in  figures  7 and  8 will  be  optimized 
with  a Rosenbrock  algorithm  [9].  For  a lower  bound  of  a = .000833  the 
Rosenbrock  algorithm  yields  an  optimum  design  at  this  value  of  a of  a mass 
ratio  of  M/MQ  = .766  corresponding  to  a base  thickness  y = .0378  for  a 
generalized  parabola  of  order  n - 2.1.  Thus,  for  this  constrained  problem 
a mass  reduction  of  23.4%  is  possible. 

Figure  9 shows  the  variation  of  base  thickness  y with  tip  thickness 
a for  internal  dampings  y = 100.  and  £ = 1.0  and  for  external  damping 
tr  = 1.0  for  generalized  parabolas  of  order  n = 1,2, 3, 4.  Figure  10  shows 
the  corresponding  variation  of  mass  ratio  M/M0  with  tip  thickness  a. 

Clearly,  n = 2 gives  the  lowest  mass  ratio  in  the  range  0 £ a < .00565, 
n = 3 gives  the  lowest  mass  ratio  in  the  range  .00565  <a  < .0C745,  and 
n = 4 gives  the  lowest  mass  ratio  in  the  range  .00745  <_a  <_  .0167.  Notice 
that  for  low  values  of  a the  curves  for  n = 3 and  n = 4 give  much  higher 
mass  ratios  and  then  from  figure  9 eventually  overshoot  the  constraint 
that  y < .05.  A beam  with  a knife  edge  would  result  in  a mass  reduction 
of  at  least  27.5%;  in  this  case  a beam  with  a finite  tip  thickness  would 
result  in  a better  mass  reduction  of  at  least  27.6 % and  indicates  at 

least  one  case  where  the  knife  edge  condition  is  not  the  minimum. 
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For  a lower  bound  of  a = .000833  the  Rosenbrock  algorithm  yields  an  optimum 
design  at  this  value  of  a of  a mass  ratio  of  M/M0  = .709  corresponding  to  a 

base  thickness  y = .0308  for  a generalized  parabola  of  order  n = 1.73  and 

hence  a mass  reduction  of  29.1%  for  this  constrained  problem. 

Figure  11  shows  the  variation  of  base  thickness  y with  tip  thickness  a 
for  internal  dampings  y = 1000  and  £ = 10.0  and  for  external  damping  ^=1.0 
for  generalized  parabolas  of  order  n = 1,2, 3, 4.  Figure  12  shows  the 
corresponding  variation  of  mass  ratio  M/M0  with  tip  thickness  a for  the 
parameters  specified  in  figure  11.  Note  that  n = 2 gives  the  lowest  mass 
ratio  in  the  range  0 £ a £ .0083,  n = 3 gives  the  lowest  mass  ratio  in  the 
range  .0083  £ a £ .0094,  and  n = 4 gives  the  lowest  mass  ratio  in  the 

range  .0094  £ a <_  .0167.  In  this  instance  a beam  with  a knife  edge  would 

result  in  a mass  reduction  of  at  least  24.8%.  For  a lower  bound  of 
a = .000833  the  Rosenbrock  algorithm  yields  an  optimum  design  at  this 
value  of  a of  a mass  ratio  of  M/M0  = .765  for  a generalized  parabola  of 
order  n = 2.1  and  hence  a mass  reduction  of  23.5%  for  this  constrained 
problem. 

Figure  13  shows  the  variation  of  base  thickness  y with  tip  thickness 
a for  internal  dampings  y = 1000  and  £ = 10.0  and  an  external  damping  of 
q"  = 10.0  for  generalized  parabolas  of  order  n = 1,2, 3, 4.  Figure  14  shows 
the  corresponding  variation  of  mass  ratio  M/M0  with  tip  thickness  a. 

Clearly  n = 2 gives  the  lowest  mass  ratio  in  the  range  0 £a  < .0057, 
n = 3 gives  the  lowest  mass  ratio  in  the  range  .0057  £ a £ .0074,  and 
n = 4 gives  the  lowest  mass  ratio  in  the  range  .0074  £ ot  £ .0167.  A 
beam  with  a knife  edge  would  result  in  a mass  reduction  of  at  least  27.4% 
with  slightly  lower  values  possible  for  a blunt  end.  For  a lower  bound 
of  a = .000833  the  Rosenbrock  algorithm  yields  an  optimum  design  at  this 
value  of  a of  a mass  ratio  of  M/M0  = .708  corresponding  to  a base  thick- 
ness of  y = .0309  for  a generalized  parabola  of  order  n = 1.726  and  hence 
a mass  reduction  of  29.2%  for  this  constrained  problem. 

Figure  15  shows  the  variation  of  base  thickness  y with  tip  thickness 
a for  internal  dampings  y = 500  and  £ = 10.0  and  an  external  damping  of 
■q  = 10.0  for  generalized  parabolas  of  order  n = 1,2, 3, 4.  Figure  16  shows 
the  corresponding  variation  of  mass  ratio  M/M0  with  tip  thickness  a.  In 
this  case  n = 2 gives  the  lowest  mass  ratio  in  the  range  0 £a  < .00475, 
n = 3 gives  the  lowest  mass  ratio  in  the  range  .00475  £a  £ .00^5,  and 
n - 4 gives  the  lowest  mass  ratio  in  the  range  .00665  £ a £ .0167.  A 
beam  with  a knife  edge  would  result  in  a mass  reduction  of  at  least  31.5% 
and  one  with  a tip  thickness  of  a = .0004  would  result  in  a mass  reduction 
of  at  least  32%,  but  this  is  still  a very  small  tip  thickness.  For  a 
lower  bound  of  a = .000833  the  Rosenbrock  algorithm  yields  an  optimum 
design  at  this  value  of  a of  a mass  ratio  of  M/M0  = .682  corresponding 
to  a base  thickness  y = .0325  for  a generalized  parabola  of  order  n = 1.97 
and  hence  a mass  reduction  of  31.8%  for  this  constrained  problem. 
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TABLE  1.  SUMMARY  OF  ROSENBROCK  OPTIMIZATION  RESULTS 


CASE 

Y 

K 

q 

axlO4 

1 

1000 

1.0 

1.0 

8.33 

2 

100 

1.0 

1.0 

8.33 

3 

1000 

10.0 

1.0 

8.33 

4 

1000 

10.0 

10.0 

8.33 

5 

500 

10.0 

10.0 

8.33 

% 


V 

n 

M/M0 

DECREASE 

.0378 

2.1 

.766 

23.4 

.0308 

1.73 

.709 

29.1 

.0378 

2.1 

.765 

23.5 

.0309 

1.726 

.708 

29.2 

.0325 

1.97 

.682 

31.8 

Table  1 summarizes  the  results  of  the  Rosenbrock  optimizations  that  were 
previously  discussed  in  figures  7 thru  16.  It  is  clear  that  from  the  five 
sets  of  parameters  for  which  an  optimization  was  made  that  reductions  in  mass 
of  from  23.4%  to  31.8%  are  possible  with  a tip  thickness  constraint  of  .000833 
specified. 


11.  CONCLUSIONS.  An  adjoint  variational  principle  has  been  applied  to 
the  flutter  stability  problem  of  Euler  beams  with  both  internal  and  external 
damping.  An  application  of  a generalized  Ritz  approximation  and  the  applica- 
tion of  a variational  principle  resulted  in  a characteristic  equation  for 
flutter  load.  Hauger's  problem  was  then  optimized  within  the  bounds  of  a 
special  class  of  parabolic  shape  functions. 

Preliminary  studies  of  the  variation  of  internal  dampings  and  external 
damping  for  the  stability  problem  indicated  suitable  ranges  of  these  para- 
meters for  use  in  the  optimization  problem.  The  various  plots  of  mass  ratio 
versus  tip  thickness  yielded  a vital  insight  into  what  should  be  expected 
from  an  optimization  routine.  It  was  quite  clear  that  a minimum  mass  ratio 
was  obtainable  in  each  case  for  a knife  edge  at  the  beams  free  end,  but  this 
condition  is  impractical.  Thus,  a minimum  tip  thickness  constraint  must  be 
applied  to  any  optimization  problem  to  obtain  realistic  results.  In  going 
to  a Rosenbrock  optimization  routine  with  a minimum  tip  constraint  of 
a = .000833  it  is  quite  clear  that  mass  ratio  reductions  of  anywhere  from 
23.4%  to  31.8%  are  possible. 
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FLUTTER  LOAD  - Q x 10 


EXTERNAL  DAMPING  - q 


Figure  3.  Variation  of  Flutter  Load  Q with  External  Damping  q for  Fixed  Values  of 
Interval  Damping  £ with  Y=0-0  for  Uniform  Cross-Section 


FLUTTER  LOAD  - Q x 10' 


INTERNAL  DAMPING  - £ 

Figure  4.  Variation  of  Flutter  Load  Q_with  Internal  Damping  5 for  Fixed 

Values  of  External  Damping  q with  7=0.0  for  Uniform  Cross-Section. 


Figure  5.  Variation  of  Flutter  Load  Q with  Internal  Damping 
Y for  Fixed  Values  of  Internal  Damping  £ with 
d|  = ] .0  for  Uniform  Cross-Section. 


BASE  THICKNESS 


MASS  RATIO  - M/M, 


TIP  THICKNESS  - 0/ 


Figure  10.  Variation  for  Mass  Rati£  with  Tip  Thickness 
for  y=100.,  5=1.0,  and  q=1.0. 


u> 


TIP  THICKNESS  -CX 


Figure  11.  Variation  of  Base  Thickness  with  Tip  Thickness 
for  y=l 000 , £=10.,  and  q=1.0. 


BASE  THICKNESS 


TIP  THICKNESS  - 0 C 


Figure  13.  Variation  of  Base  Thickness  with  Tip  Thickness 
for  y=1000,  £=10.,  and  q=10. 


MASS  RATIO  - M/M 


Figure  14. 


Variation  of  Mass  Ratio_with  Tip  Thickness 
for  y=1000,  5=10.,  and  q=10. 


Figure  15. 


Variation  of  Base  Thickness  with  Tip  Thickness 
for  y=500,  5=10.,  and  q=10. 


MASS  RATIO  - M/M 


Figure  16,  Variation  of  Mass  Ratio  with  Tip  Thickness  for 
y=500  , 5=1 0.  , and  q = l 0. 


Buckling  of  Orthotropic  Rectangular  Cylinders* 

Earl  C.  Steeves 

U.S„  Army  Natick  Laboratories 

Natick,  Massachusetts 

One  of  the  loading  situations  that  must  be  considered  in 
the  design  of  packages  is  that  resulting  from  stacking.  Since 
this  is  a compressive  loading,  the  buckling  mode  of  failure  must 
be  considered.  To  obtain  some  guidance  for  this  design  problem, 
solutions  to  the  buckling  problem  for  rectangular  cylinders  were 
obtained.  These  solutions  were  obtained  by  direct  solution  of 
the  differential  equations  and  by  approximate  solution  using  the 
Rayleigh-Ritz  procedure.  Both  solutions  are  based  on  classical 
orthotropic  plate  theory  and  treat  the  corners  by  conditions  of 
continuity.  The  direct  solution  which  is  restricted  to  the 
uniform  load  results  in  a transcendental  eigenvalue  problem  for 
the  buckling  load  and  is  solved  by  a search  and  interpolation 
scheme.  The  approximate  solution  utilizes  finite  difference 
approximations  in  conjunction  with  the  Rayleigh-Ritz  analysis 
and  is  applicable  to  both  the  uniform  and  nonuniform  load 
situations.  The  constraint  equations  describing  the  corner 
conditions  are  incorporated  in  the  Rayleigh-Ritz  procedure  by 
a technique  not  commonly  employed.  The  resulting  algebraic 
eigenvalue  problem  is  solved  using  the  Jacobi  reduction  proce- 
dure. Results  from  the  direct  and  approximate  solutions  are 
compared  for  the  uniform  load  case  and  results  for  the  non- 
uniform  load  case  obtained  using  the  approximate  solution  are 
compared  with  experimental  results  for  rectangular  fiberboard 
cylinders . These  comparisons  with  experiment  indicate  a need 
to  examine  the  nonlinear  buckling  problem  and  to  include 
plasticity  effects  in  the  case  of  fiberboard. 


♦This  paper  will  be  published  as  U„S.  Army  Natick  Development 
Center  Technical  Report  75-99  entitled,  "Theoretical  and 
Experimental  Investigation  of  the  Buckling  of  Rectangular 
Cylinders". 
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ENERGY  RELEASE  RATE  IN  TERMS  OF  COMPLEX  ANALYTIC  FUNCTIONS 


M.  A.  Hussain  and  S.  L.  Pu 

Benet  Weapons  Laboratory 
Watervl i et  Arsenal 
Watervliet,  New  York  12189 


ABSTRACT . The  energy  release  rate  $ for  two  dimensional 
cracks  subjected  to  either  plane  loading  or  anti-plane  shear  is 
most  commonly  computed  by  Irwin's  crack  closure  method  or  the 
path  independent  integrals  used  by  Rice.  In  terms  of  analytic 
functions  of  a complex  variable,  these  two  approaches  give  two 
distinct  expressions  for  the  energy  release  rate.  For  instance, 
for  plane  problem  the  energy  release  rate  is  given  in  terms  of 
one  analytic  function  using  Irwin's  approach  while  the  same  is 
expressed  in  terms  of  two  analytic  functions  by  the  method  of 
path  independent  integrals.  For  special  cases,  such  as  Mode  I 
crack,  these  two  expressions  lead  to  identical  results.  How- 
ever, the  identity  for  the  general  case  has  not  been  established. 
In  this  report  we  have  shown  that  the  expression  of  § for  plane 
problem  in  terms  of  two  analytic  functions  can  be  further  reduced 
to  the  same  expression  obtained  by  Irwin's  method  in  terms  of 
only  one  analytic  function  and  that  the  two  distinct  expressions 
of  $ for  anti-plane  problems  are  identical. 


1.  INTRODUCTION.  The  energy  release  rate  $ , for  two  dimen- 
sional crack  problems  subjected  to  either  plane  loading  conditions 
or  longitudinal  shear  can  be  computed  by  various  methods.  The, 
most  common  ones  are  the  'crack  closure'  approach  of  Irwin  [1] 
and  path  independent  integrals  used  by  Rice  [2],  The  equivalence 
of  these  two  methods  is  rigorously  derived  in  [2],  [3],  and  [4]. 

Such  crack  problems  are  usually  formulated  in  terms  of  one 
or  two  analytic  functions  of  a complex  variable.  Two  distinct 
expressions  for  $ are  obtained  in  terms  of  analytic  functions  by 
these  two  methods.  For  self  similar  crack  extensions  for  which 
these  analytic  functions  can  be  explicitly  derived,  these  two 
different  approaches,  without  any  ambiguity,  lead  to  identical 
results.  However  for  non  self  similar  situations,  for  example 
in  branching  of  a crack,  one  must  be  very  careful  in  regard  to 
the  use  of  path  independent  integrals  in  calculating^.  In  such 
a case  it  is  necessary  to  evaluate  the  path  independent  integrals 
around  a contour  just  surrounding  the  tip  of  the  branched  crack 
and  not  around  the  entire  end  of  crack  involving  the  main  crack 


^Numbers  in  brackets  designate  References  at  end  of  the  paper. 
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and  the  branched  crack,  (see  e.g.  [5]).  Using  this  approach  it 
is  shown  in  this  paper  that  the  path  independent  integrals  and 
the  "crack  closure"  approach  lead  to  identical  results  in  terms 
of  analytic  functions.  This  equivalence  is  shown  by  the  use  of 
mapping  techniques. 


2.  COMPUTATION  OF  § USING  IRWIN'S  APPROACH.  Considering  two 
analytic  functions  <|>-|(z),  ^(z)  of  a corrTpl  eTvariabl  e z = x+iy,  the 

stresses  and  displacement  for  the  plane  problem  can  be  written  as 

[6]: 

(1  ) 0X  + CTy  = 4 Re{<^  ( z ) > 

(2)  ay  - ax  + 2i  Txy  = 2(7^' (z)  + ^(z)) 


2 u ( u + i v ) = k 4>  -,  ( z ) - z d>-J  ( z ) - ipi(z) 

(3)  11  - 

i / (Xn  + iYn)ds  = d>-|  ( z ) + z <J>-|(z)  + xp  i ( z ) 


For  the  anti-plane  problem,  using  F(z)  as  an  analytic  function,  we 
have  the  stresses  and  displacement  : 

(4)  xxz  - i xyz  = y F1 (z) 

(5)  w = Re{F(z ) } 


It  is  convenient  to  use  a mapping  function  such  that  the  crack 
contour  and  its  exterior  are  mapped  onto  a unit  circle  and  its 
exterior  respectively.  Let  such  a mapping  be  denoted  by  z=w(c). 
Denoting  the  tips  of  such  a crack  by  z^  and  their  images  in  the 

C-plane  by  such  mapping  requires  a)'UA)=0  [5].  Hence  we  have 

the  following  expansion  near  z^ 

(6)  z-z^  = u>(c)  - uj(ca)  = y(c-C^)2w'  ' (zz)  + 
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Using  the  near  field  Westergaard  solution  of  the  tips  of  the 
crack  we  have,  using  the  usual  notations,  with  R+0 


(7)  lim  (a  +a  ) = Re{  K/2/W  e" 1 0/2 } , K = Kj  - 1 Kn 

Rt-0  y 

Let  the  inclination  of  the  crack  tip  to  the  x-axis  be  6 (Fig.  1), 
we  have  from  (6) 

(8)  R e1  (0+<s ) = z-z^  = ^-(c-c^w"^)  + 0(e-?A)3 


and 

(9)  w'U)  = (?-£*)«"  (5A)  + 0(c-^)2 

From  (1)  with  <J>-J  ( z ) = <\>'  (c)/u'  (?) , <^(z)  = ' (?)/“*  ( c)  etc.  and 

using  (8),  (9)  we  have 

(10)  lim  (ox+0y)  = 4 Re(4>'  (c^)[2R  w'  ' ( C5,)ei  6]  ^2  e 1 ° ^ 2 } 


Comparing  (7)  and  (10)  for  all  9,  we  have 

(11)  K = Kj  - iKn  = 2 tt1/2  (f)'  (C£)//e1<5w'  ' U*) 


After  some  manipulation  using  F'(z)  = f'(c)/w'(?)  we  get  the 
corresponding  expression  for  the  anti-plane  problem: 


(12)  Km  = ip  f 1 1 (Tjt) 


These  expressions  may  readily  be  available  in  literature.  It 
will  be  seen  that  <5  plays  a crucial  role  in  path  independent 
integrals.  Following  Irwin's  approach  we  obtain  the  energy 
release  rates: 


(13) 


47T  <t>  1 U&)  <{>  ' Uft) 

7 (B"(?p  u"(?J))1/2 


(for  plane  stress) 
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04) 


(for  anti -plane) 


yir  f,2(?£) 

2 e"l6w' ' (cA) 


3.  COMPUTATION  OF  USING  PATH  INDEPENDENT  INTEGRALS.  Using 
path  independent  integrals  the  following  expressions  "have  been 
derived  by  Budiansky  and  Rice  [7],  for  an  open  contour: 


B B 

(15)  J-j+iJ^  = - { / <t>-|2(z)dz  - 2 / $1  (z  )i|n  (z  )dz  - 

E A A 1 


~ o B 

- Cz  <Mz)2]  } 

A 

1 B o 

(16)  J1  - id2  = 7 V I ( F ' (z)  )2dz 

2 A 

for  plane  and  anti-plane  problems  respectively  (the  term  in  square 
bracket  is  the  jump  term).  Since  the  crack  tip  is  inclined  at  an 
angle  5 to  the  x-axis,  we  need  the  above  integrals  for  an  inclined 
coordinate  system.  Rotating  the  coordinate  axes,  counterclockwise 
through  an  angle  6 and  denoting  the  transformed  quantities  with 
superscript  (1),  we  have  for  the  plane  problem  [6] 


® ( 1 ) ( z ( 1 ^ ) = $ ( z ^ ^ e 1 5 ) 

(17)  fO>(z(l>)  = e2i6T(z^  ^e1'6) 

where  « (z ) = ^ (z ) , V(z )=^-j  (z  ) etc.  Using  (17)  in  (15)  we  have 

(18)  J j ^ + i J ^ ^ = e ” ^ 5 ( 0 1 + i J 2 ) 

Similarly  for  the  anti-plane  problem  we  have 

(19)  F(1)‘(z(1))  = ei6  F'(z(1)ei<S) 
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and  (19)  into  (16)  gives 


(20) 


(D 


- i J 


o)  . 


= e 


(J-j  - i J2) 


It  is  easy  to  show  that  the  above  integrals  remain  invariant 
under  translation  of  coordinate  systems  and  hence  without  loss 
of  generality  we  make  an  additional  assumption  that  one  of  the 
crack  tips  is  located  at  the  origin  of  the  z-plane,  i.e., 
z^=w(c£)  = 0.  Using  the  mapping  function  described  previously 

equations  (15)  and  (16)  become 


(21) 

and 

(22) 


J-|  + i 02 


J1  " i J2 


2i  b<p ' 2 ( C ) 

{{/  d5 

E a ai*  (c) 


1 b f'2U) 

J vf  — — d? 

2 a w'U) 


b<t>'  UH'  (?) 

2 f d?  - 

a w1 (_?) 

<t>'(z)  2 b 

M?)  ( ) ] > 

w'  U)  a 


where  a,  b in  the  s-plane  corresponds  to  A and  B on  the  z-plane, 
in  the  limit  as  r-*-0  (see  Fig.  1).  Now  making  use  of  (6)  with 
Zji=o)(Zji)=Oi  (location  of  the  crack  tip  at  the  origin  of  the 
z-plane)  we  see  that  V(?5,)  is  finite  (see  eq.  (31)).  Equations 
(21),  (22)  can  be  integrated  using  a circular  contour  of  vanishing 
radius  around  (Fig.  1) 


(23) 


J 1 + i J 2 


2tt  r<f>'2(?<>) 


<t>'  (?*)  \"l 


1 (?0) 


JJ 


(plane) 


Note  that  the  jump  term  in  equation  (21)  vanishes.  The  correspond- 
ing result  for  anti-plane  problem  becomes: 

(24)  J]  - 1 J2  a - J y f ,2UA)/a'  ' U*) 

Rice  has  shown  [2]  that  the  energy  release  rate  along  the  plane 
of  the  crack  is  given  by  Jj1).  Hence  we  have 
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(25) 


9 


(D 


2tt 

= — Re 
E 


-i6 


,6( 


*^u() 


a)l  ’ (^) 


+ 2 


4> ' U&) 


U) 


I I 


(Co) 


)' 


(26) 


t -A" 


I „ f f'2(^  + f'2(tt)  ^ 

4 U e 16  u"(Cjl)  e16  «"(?*)' 


for  the  plane  and  anti-plane  problems  respectively.  In  the  sequel 
it  will  be  proved  that  (25)  and  (26)  reduce  to  (13)  and  (14) 
respectively. 


4.  EQUIVALENCE  OF  TWO  DISTINCT  EXPRESSIONS  FOR  ff.  Without 
loss  of  generality  a traction  free  boundary  condition  is  assumed 
on  the  crack.  Since  z j^=u)(zS/)  = 0 we  have  for  z near  z% 

(27)  LiU)  = jU-^^)2®'  1 (zz)  + 0(z-zz)3 

(c)  s (C-Ca)u*  1 (ca)  + OU-S^)2 

From  (1)  and  (2)  and  using  pol ar  . coordi nates  [6],  the  boundary 
condition  on  the  unit  circle  a=eie  in  the  z plane  becomes,  (with 
$(cr)=4> 1 ( cr ) / co 1 (a) ) 

4>‘  (cr)  <f> 1 (a)  a2  

(28)  crr  - iTrfl  = + , - . {w(a)  $ 1 ( a ) +ifi 1 (a)}  = 0 

r0  (o’(ff)  a)'(a)  co'(a) 

Multiplying  by  a)'(cr),  which  has  a zero  at  ^=Z^>  we  have. 


cj 1 (a)  « w(a) 

(29)  4> ' ( a ) + — 4>'(cr)  - <f>,l(a)  + 

co 1 (a)  a)'  (a) 


?o)(a)  co1  1 (a)  7b)'  (a) 

+ a z=  (a)  - a ip ' ( a ) = 0 

co 1 (a)  to'  (a)  co'  (a) 
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Substituting  (27)  in  (29)  and  approaching  the  limit  we 

have,  (since  o and  are  both  on  the  unit  circle): 


(30) 


2 v(^) - q 


03 


‘Up) 


*'U0)  + Sjf 


0) 


' (Co) 


03 


(t.) 


•*'(?,)  * 0 


Solving  for  xp * ( ^ £ ) we  have 


, , w 1 1 ( c ^) 

(31  ) ip‘  (zz)  = “2  (c^)  " 4 <l> 1 ( CA) 

2 W ^ 

Since  is  on  the  unit  circle  we  write  c^=e1a^.  From  (8)  it  is 

clear  that  the  top  and  bottom  lines  of  the  crack  form  a tangent 
to  the  unit  circle  at  in  the  c-plane.  Consider  a point  on  the 
bottom  line  close  to  the  crack  tip,  point  A in  Fig.  1,  9 = -tt, 

(as  r+0)t  equation  (8)  yields 

(32)  R ei6  = \ r2^w"(^) 

Taking  complex  conjugate  of  (32)  and  eliminating  R and  r we  have 


(33)  e16  - 

Substituting  (31)  and  (33)  into  (25)  and  rearranging  we  get 


- 2 4> 1 Ufl)  4> 1 Up 


+ 


♦ l2Ut) 
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In  which  the  first  term  is  real  and  the  second  term  is  imaginary, 
hence 


(34) 


4 n 4»‘  (CA)  <f>'  (C£) 


which  is  the  same  as  equation  (13)  obtained  by  Irwin's  approach. 
**  Equation  (31)  indicates  that  VU$,)  is  finite.  This  primarily 
is  due  to  our  assumption  that  the  crack  tip  is  at  the  origin  on 
z-plane  i.e.,  z^=a)(z^)  = 0 . (See  appendix  for  a general  case). 


For  the  antiplane  problem,  using  cylindrical  coordinates, 
we  have 


(35)  xrz  - i t0z  = u F 1 ( z ) e 1 0 


Again  assuming  a traction  free  boundary  condition  in  the  vicinity 
of  the  crack  tip  we  have 

Im{y  F'(z)ei6}  = — { e 1 6 F ' ( z ) - e_i6  F * ( z ) > = 0 

21 

and  using  the  mapping  function  (i.e.,  F ' (z  ) = f 1 ( z,  )/w 1 ( c) ) we  have 
(36)  ei6  f'(c)/«'(c)  = e"1'6  -f 1 (c)/w'  (c) , 4=a=ei0 


In  the  limit  as  using  (27)  and  (33  ),  (36)  becomes 


(37)  f ' = - f 1 U%) 


Using  (37)  and  (33)  into  (26)  we  get 


(38) 

which 


to 


is  the  same  as 


UTT  f ' 2 ( C 5,  ) 

2 e"lS 

equation  (14)  obtained  by  Irwin's  approach. 
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APPENDIX 


In  the  text  we  have  located  one  of  the  crack  tips  at  the 
origin  of  the  z-plane.  This  leads  to  finiteness  of  \p' (zz)  for 
the  plane  problem.  For  an  arbitrary  location,  ^'(Cj,)  involve 

singularity  and  the  integrals  together  with  the  jump  term 
become  singular  as  we  take  the  contour  in  the  c-plane  of 
vanishing  radius.  Here  we  show  the  annihilation  of  these 
terms . 


Using  Taylor's  expansion  around  the  crack  tip  c^  with 
a)' (Cjji)  = 0 we  have  (referring  to  equation  (27)) 


(A-l) 


l»U)  = u(Cjj)  + jU-Cj)2 * *!*)"  (?t)  + J-U-SjjV'  ' (;*)  +••' 


03 


•(c) 


(-M 

u>'(c) 


{1  + c-j  ( C- C&)  + c2(t-Zz)2  +“*} 

«"(?£)(?-?£) 

1 -.0 

= , .2T-  w 7i  {1  + 2 + +" 

W* '^(?£)(c-c£) 


where  c^  = - (1  /2  )w'  1 1 ( c £ ) /oo 1 1 ( tz)  etc. 

Contour  integrations  and  the  jump  terms  are  given  by  (Fig.  1) 


( A - 2 ) 


b dc 

/ = ‘mi  , 

a ?-?£ 


b dc 

l 1^7 


2 C o b dc 

— . J 3 = 0 

r a (c-C£) 


^)b  • \ 
?-?j , a r 


, 1 \b 

l: 72)  - 0.  = re 


i 6 


(C-C£)  a 


From  boundary  condition  (29)  we  have  (a=e10  is  on  the  unit  circle) 


(A-3 ) 


^ 1 (a)  = -5- 


1 to'  (a) 


1 


O 

a oj  1 (a) 


4>‘  (<*)  + -=■  4> ' (a)  - 


to(a) 


0) 


'(a) 


<f> * ' (cr)  + 


w(a)  w1 ' (a) 

+ t <J>'(a) 


03 


2(cr) 
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Substituting  (A-l)  in  (A-3)  and  using  analytic  continuation  we 
have 


1 w(Sj»)  * 

( A- 4 ) 11m  *'(c)  = <f>'(c«)  + r (C) 

U-?*)2  «"(?*> 

where  \p’  (c)  is  regular  at  Hence 


2<f>  (c)^'  (?)  2w(?a) 

(A-5)  11m— = « 3 

u'(c)  w"^(c4)  (C-C*,) 

<♦*  (Cjt>  + [♦"(c*)  + Cl  4* ' (Cji)](c-C£)  + o U-?A)2} 


As  can  be  seen  from  (A-5)  and  (A-2),  (21)  has  singular  terms  as 
r+0  in  the  second  integral  given  by 


(A-6 ) 


2 W ( C n ) O 


Considering  now  the  complex  conjugate  of  the  jump  term  in  (21), 
we  have 


(A-7) 


__c|>'2(a)  i 2 2co  ( e £ ) 

w(a) — » 0 ( — — ) + 


W 


2(c) 


5-Ct 


w 


. .2 


(C0) 


[♦'  UjH'  ' (CA)  + c!  ' 2 ( C ^)  ] 


+ • * * 


From  (A-7)  and  (A-2)  we  have  the  jump  term  from  (21), 

-»lZU)lb  2C(.  2 <,(1^) 


(A-8) 


11m 

r^O 


«(0 


«'2U)Ja  r w,,z(C£) 

U'  (CA)4» ' ' (Ca)  + ci^* 2(cJt)3 


which  is  the  negative  of  (A-6)  and  hence  leads  to  annihilation  of 
the  singular  terms  in  (21), 
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Figure  1.  Mapping  and  contours  of  integration.  In  the  limit 
as  the  contours  shrink,  3=a  and  6=b  approach  the 
images  of  A and  B on  the  c-plane. 


DEVELOPMENT  AND  APPLICATION  OF  DYNAMIC  MATHEMATICAL  MODELS 
FOR  EVALUATION  OF  MILITARY  SYSTEMS,  FORCES  AND  DOCTRINE 


Roger  F.  Willis 

US  Army  Combined  Arms  Combat  Developments  Activity 
Fort  Leavenworth,  Kansas 


ABSTRACT.  This  paper  starts  with  a review  of  alternative  combined 
arms  study  objectives  for  which  flexible  mathematical  models  are 
required  and  definition  of  the  characteristics  that  such  models 
should  possess.  Specific  mathematical  models,  involving  sets  of 
simultaneous  differential  equations,  and  their  solutions  (as  well  as 
indications  of  how  these  models  can  be  applied  to  real  problems)  are 
presented  in  the  following  areas: 

1.  tank  combat,  including  artillery 

2.  anti-tank  weapon  employment 

3.  air  defense  suppression 

4.  helicopter-air  defense  duels 

5.  support  weapon  employment  rates 

1.  This  paper  will  cover  a variety  of  classes  of  mathematical  models 
of  military  operations.  Before  doing  that  we  will  briefly  discuss  the 
types  of  studies  we  do  in  TRADOC  and  explain  how  mathematical  models 
fit  into  our  study  methodologies. 

2.  To  meet  combat  development  objectives  we  carry  out  many  different 
types  of  studies,  some  using  large,  expensive  war  games  or  computer 
simulations.  These  types  of  applications  are  listed  in  Table  1.  Ex- 
amples of  specific  studies  in  some  of  these  categories  are: 

Type  2.  Comparison  of  a US  division  supported  by  scatterable 
mines  with  a US  division  without  scatterable  mines. 

Type  4,  Comparison  of  the  new  XM-1  tank  with  the  standard 
M-60  tank. 

Type  5.  Evaluate  alternative  tactics  and  procedures  for 
air  defense  suppression. 
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Type  6.  Determine  the  optimum  mix  of  intelligence-gathering 
systems. 

Type  7.  Determine  range  requirements  for  artillery. 

Type  8.  Determine  the  numbers  of  mortars  of  each  caliber 
required  in  each  type  of  maneuver  battalion. 

Type  9.  Develop  effective  countermeasures  against  antitank 
missiles. 

Type  10.  Determine  the  impact  of  political  constraints  on 
tactical  nuclear  doctrine. 

3.  First  we  consider  the  specific  example  of  a type  2 study  --  a US 
division  without  scatterable  mines  (Force  A)  compared  with  the  same  US 
division  wi th  scatterable  mines  (Force  B).  Suppose  we  use  a large,  com- 
plex, computer-assisted  war  game,  play  two  games,  and  get  the  results 
summarized  at  Figure  1.  We  played  about  a day  of  combat  with  Force  A 
and,  starting  over  against  the  same  energy  force,  played  about  a day  of 
combat  with  Force  B.  Force  B appears  to  be  more  effective  based  on  the 
first  three  measures  of  effectiveness,  but  Force  A is  more  effective  if 
we  use  any  one  of  the  last  three  measures.  Thus,  as  we  would  expect,  the 
study  results  are  sensitive  to  the  measure  of  effectiveness  used,  A 
second  point  is  that  the  differences  measured  between  Force  A and  Force 

B might  not  be  large  enough  to  be  significant  (e.g.,  27  hours  compared 
with  23  hours). 

4.  These  aggregated  results  are  backed  up  by  a great  variety  of  detail 

that  can  be  studied  to  develop  insights  concerning  the  real  differences 
in  capabilities  between  Force  A and  Force  B.  In  particular  both  games 
are  based  on  hundreds  of  assumptions,  some  of  which  might  have  major 
impacts  on  the  combat  results.  This  raises  the  question:  How  sensitive 

are  the  results  to  variations  in  the  assumptions?  Some  of  the  assump- 
tions are  key  elements  in  the  scenario. 

5.  For  every  study  we  are  required  to  use  a standard  scenario.  This 
could  give  us  results  as  illustrated  in  Figure  2,  in  which  Force  A and 
Force  B are  not  significantly  different  against  the  standard  size  of 
enemy  attack.  The  measure  of  effectiveness  used  might  be,  for  example, 
the  percent  of  US  tanks  surviving  at  the  end  of  3 days.  If  we  use  a 

1 arqer  eneniy  force  and  re-run  both  the  Force  A game  (45%)  and  the  Force 
B game  (65%)  then  Force  B becomes  significantly  better  than  Force  A. 
Another  scenario  factor  is  the  amount  of  close  air  support  available  to 
the  US  force.  The  lower  half  of  Figure  2 shows  that  increasing  the 
amount  of  close  air  support  above  standard  still  leaves  Force  A about 
equal  to  Force  B in  effectiveness.  However,  reducing  close  air  support 
makes  Force  A significantly  more  effective  than  Force  B.  Thus  this 
scenario  factor  reverses  the  trend. 

6.  In  order  to  play  these  six  extra  games,  for  investigating  the 
sensitivity  of  study  results  to  only  two  factors,  might  take  15  people 
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„ * Tf  4,  rlear  that  we  need  more  efficient  ways  of  doing 

IsSHArsiss:  as  ISIS  &w 

a.  ground  combat 

b.  artillery 

c.  attack  helicopters 

d.  close  air  support 

e.  battlefield  interdiction 

f.  tactical’  nuclear 

g.  ai r defense 

h.  reserves 

i . repl acements 

j.  resupply 

k.  air  reconnaissance 

l . ground  surveillance 

m.  information  flow 

n.  ai rmobil e operations 

o.  engineer  operations 

p.  transportation 

q.  command  and  control 

7.  These  models  are  characterized  not  by^he  systems* that 

formulation  but  by  the  categories  of  m 1 t Y P analytical  sup- 

5JswrJ.H«ypof  ^ 

later. 

8.  The  first  example  lrjwSdlrecthfire),'and''artillery.  The 

four^fferential^equat  Ions' describing  this  model  appear  in  Figure  3. 

The  basic  assumptions  are: 

. Tanks  fire  at  tanks  and  artillery  Is  employed  against  tank  units 
and  also  ?n  counterbattery  fire  against  opposing  artillery. 
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b.  The  rate  at  which  each  side  loses  tanks  is  proportional  to  the 
number  of  tanks  employed  by  the  other  side  (and  still  surviving)  and 
proportional  to  the  number  of  artillery  tubes  the  other  side  is  using 
against  tanks. 

c.  The  rate  at  which  each  side  loses  artillery  is  proportional  to 
the  number  of  artillery  tubes  the  other  side  employs  against  artillery. 

9.  The  factors  in  this  model  are  defined  as  follows: 

B(t)  = number  of  Blue  tanks  surviving  at  time  t 

R(t)  = number  of  Red  tanks  surviving  at  time  t 

H(t)  = number  of  Blue  artillery  tubes  surviving  at  time  t 

G(t)  = number  of  Red  artillery  tubes  surviving  at  time  t 

J * rate  at  which  each  Red  tank  kills  Blue  tanks 

K = rate  at  which  each  Blue  tank  kills  Red  tanks 

L = expected  number  of  Blue  tanks  killed  per  Red  artillery  round 
f i red . 

M = expected  number  of  Red  tanks  killed  per  Blue  artillery  round 
f i red 

r = rate  of  Red  artillery  fire 
p = rate  of  Blue  artillery  fire 

E = expected  number  of  Blue  artillery  tubes  killed  per  Red  artillery 
round  fired 

F = expected  number  of  Red  artillery  tubes  killed  per  Blue  artillery 
round  fired 

f = fraction  of  surviving  Red  artillery  tubes  employed  against  tanks 

g = fraction  of  surviving  Blue  artillery  tubes  employed  against  tanks 

Note  that  the  last  two  factors  (f  and  g)  are  tactical  decision  parameters 
in  contrast  with  the  other  variables  that  represent  numbers  available  and 
performance  characteristics  of  various  weapon  systems.  Therefore  this 
model  does  represent  one  aspect  of  tactics  explicitly. 

10.  The  solutions  of  this  model  are  in  the  following  form: 

a.  Number  of  Blue  tanks  surviving  versus  time 

b.  Number  of  Red  tanks  surviving  versus  time 
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c.  number  of  Blue  artillery  tubes  surviving  versus  time 

d.  number  of  Red  artillery  tubes  surviving  versus  time 

These  solutions  have  been  obtained  analytically,  without  using  a computer. 

11.  This  model  has  a parallel  in  any  war  game  in  that  periodically  the 
Blue  and  Red  commanders  must  decide  (unless  it  is  programmed  to  happen 
automatically)  what  part  of  their  surviving  artillery  force  to  employ  in 
counterbattery  fire.  To  some  extent  this  decision  would  be  influenced 
by  target  acquisition,  but  this  could  also  be  included  in  the  model  in 
Figure  3 by  expanding  the  aggregated  coefficients  0,  K,  L,  M,  E,  and  F to 
include  several  types  of  target  acquisition  capability.  One  difference  is 
that  in  a war  game  the  players  have  no  way  to  find  optimum  tactics  whereas 
this  model  could  be  used  to  find  optimum  values  for  f (Red  tactic)  and  for 
g (Blue  tactic).  The  values  of  f and  g could  also  be  varied  from  time 
period  to  time  period.  Note  that  the  vulnerability  of  Red  tanks  to  Blue 
artillery  fire  is  included  in  the  factor  M and  the  vulnerability  of  Red 
artillery  to  Blue  artillery  fire  is  included  in  the  factor  F. 

12.  The  advantage  in  having  these  solutions  as  explicit  functions  of  all 
the  assumptions  and  parameters  is  that  one  can  carry  out  a wide  variety  of 
analytical  variations  before  assuming  numerical  values  for  any  of  the 
factors.  Some  of  these  variations  are: 

a.  Values  versus  time 

(1)  number  surviving 

(2)  percent  surviving 

(3)  force  ratio  of  survivors 

(4)  number  lost 

(5)  percent  lost 

(6)  loss  ratio 

b.  Time  required  for  mission  accomplishment,  e.g..  Red  losses  = 30 
percent. 


c.  Sensitivity  of  "values"  or  "times"  to  force  sizes,  tactics,  or 
other  assumptions. 

d.  Optimize  mix  of  systems 

e.  Optimize  tactics 

f.  Catalog  situations  giving  the  same  results 
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In  the  case  of  tanks  the  six  "values"  listed  under  a.  above  are  as 
follows: 


a.  Number  surviving:  B(t) 

b.  Percent  surviving:  100  B(t) 


c.  Force  ratio: 

d.  Number  lost: 

e.  Percent  lost: 


R(t)/B(t) 


f.  Loss  ratio:  L = RQ  - R(t) 

B0  - B(t) 

Bq  and  R0  are  the  numbers  of  tanks  available  at  time  o.  The  time  required 
to  kill  at  least  30  percent  of  the  Red  tanks  can  be  found  by  solving  the 
following  equation  for  time  t: 


B(t)  = 0.70  Bq 

The  sensitivity  of  the  loss  ratio  to  initial  force  sizes  can  be  determined 
from 

dL  and  dL 

dR  tUT  . 

o o 

13.  In  this  model,  tactics  can  be  optimized  for  Blue  by  finding  that 
value  of  g that  will  accomplish  one  of  the  following  goals: 


a.  Maximize  B(t)  at  a given  t (e.g.,  2 hours) 

b.  Minimize  R(t)/B(t) 

c.  Maximize  L = RQ  - R(t) 

B0  - B(t) 

d.  Minimize  time  required  to  kill  at  least  X percent  of  the  Red  tanks. 

e.  Maximize  some  weighted  average  of  Blue  tanks  surviving  and  Blue 
artillery  surviving  at  a given  time  t.  (e.g.,  maximize  a B(t)  + bH(t)). 


14.  Using  the  explicit  solutions  of  this  model  one  can  catalog  all  sit- 
uations (combinations  of  values  of  input  factors)  that  give  the  same 
results,  such  as  20  percent  of  the  Blue  tanks  lost  by  4 hours  after  the 
battle  starts.  One  example  would  be  all  combinations  of  Blue  artillery 
effectiveness  against  tanks  (M)  and  initial  number  of  Red  tanks  (R0) 
that  result  in  no  more  than  5 percent  of  Blue  tanks  lost  at  4 hours. 
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15.  The  above  paragraphs  give  specific  examples  of  the  types  of 
calculations  that  can  be  developed  from  the  type  of  model  illustrated. 

In  more  general  terms  eight  uses  of  mathematical  models  of  military 
operations  are: 

a.  Compare  alternative  systems 

b.  Compare  alternative  tactics 

c.  Optimize  tactics 

d.  Tradeoffs  between  system  parameters 

e.  Narrow  down  number  of  alternatives  to  be  gamed 

f.  Select  ranges  of  values  for  inputs 

g.  Select  likely  enemy  tactics 

h.  Sensitivity  analysis 

Both  alternative  systems  and  alternative  tactics  can  be  compared  in  a 
force-on-force  context.  Optimizing  tactics  has  been  discussed  in  the 
earlier  example.  The  fourth  type  of  use  could  be  illustrated  by  trade- 
offs between  lethality  and  rate  of  fire.  Type  e might  be  narrowing 
down  the  number  of  levels  of  close  air  support  to  use  in  actual  gaming  — 
levels  that  are  feasible  and  also  have  been  shown  by  mathematical  modelling 
to  have  significant  impact  on  the  particular  types  of  study  results  of 
interest.  Type  f could  involve  values  for  an  enemy  weapon  performance 
parameter  for  which  we  do  not  have  good  estimates.  Type  a could  be,  for 
example,  weapon-target  priorities. 

16.  For  discussing  specific  models  in  this  paper  we  have  defined  classes 
of  combat  models  according  to  the  eight  factors  in  Table  2.  Factor  3 
refers  to  variation  with  time  (or  with  weapon  range,  for  example).  We 
have  given  several  examples  earlier  of  aggregating  or  expanding  coeffi- 
cients to  represent  specific  elements  such  as  vulnerability,  rate  of  fire, 
target  acguisition,  etc.  (Factor  4).  Factor  5 concerns  allocations  of 
weapons  against  targets  and  the  replacements  in  Factor  6 are  either  per- 
sonnel or  eguipment. 

17.  Setting  factors  1 - 6 at  two  levels  each  we  have  64  classes  of 
models.  Thus,  for  example,  considering  only  tank  and  antitank  direct 
fire  weapons  would  give  64  classes.  However,  factors  7 and  8 raise  the 
number  of  classes  of  models  to  well  above  256.  For  example  in  factor  8 
we  could  have  1,  2,  3,  or  4 types  of  Blue  systems  and  1,  2,  3,  or  4 types 
Red  systems  in  the  models.  We  have  developed  at  least  50  models  in  various 
classes  with  the  taxonomy  of  Table  2.  For  factors  7 and  8 the  types  of 
weapons  of  major  interest  are  listed  in  Table  3. 
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18.  Now  we  present  a model  designed  to  study  tactical  interactions 

between  tanks  and  antitank  missiles,  in  Figure  4.  The  situation  is  as 
follows:  Blue  has  both  tanks  and  antitank  missiles;  Red  has  only  tanks; 

how  should  Red  employ  its  tanks?  The  fraction  of  Red  tank  fire  directed 
against  Blue  tanks  is  f.  What  is  the  optimum  value  of  f? 

19.  By  solving  these  three  equations  to  obtain  the  remaining  weapons  on 
both  sides  as  explicit  functions  of  time  and  of  all  the  other  parameters 
(including  the  initial  strengths)  we  can  find  out  how  the  optimum  values 
of  f depend  on  all  these  quantities  and  on  time  (duration  of  combat). 

The  solutions  are  given  in  Figures  5 and  6.  In  order  to  optimize  f. 

Red  could  try  to  maximize  the  number  of  Red  tanks  surviving  or  the  force 
ratio  surviving;or  to  minimize  the  number  of  Blue  tanks  surviving,  the 
number  of  Blue  antitank  missiles  surviving  or  some  linear  comDination  of 
these  Blue  weapons. 

20.  In  Table  4 we  list  the  major  interactions  in  the  air  defense  sup- 
pression model  presented  in  Figure  7.  Both  sides  carry  out  all  six 
operations  listed  in  Table  4 and  thus  Red  and  Blue  each  have  two 
allocation  problems: 

a.  Allocate  artillery  fire  against  air  defense  weapons  or  in 
counterbattery  fire. 

b.  Allocate  aircraft  (close  air  support)  against  tanks  or  against 
air  defense  weapons. 

Thus  complete  analysis  of  results  from  this  model  would  involve  a game 
matrix  in  which  each  side  has  at  least  9 alternatives.  Artillery  versus 
air  defense  at  high,  medium  or  low  values  times  aircraft  versus  air 
defense  at  high,  medium  or  low  values. 

21.  The  allocation  factors  in  Figure  7 are  defined  as  follows: 
fR  = fraction  of  Red  aircraft  used  against  Blue  tanks 

fg  = fraction  of  Blue  aircraft  used  against  Red  tanks 

gR  = fraction  of  Red  artillery  used  against  Blue  air  defense 

gB  = fraction  of  Blue  artillery  used  against  Red  air  defense. 

The  kill  factors  H,  J,  C and  D in  the  fifth  and  sixth  equations  could 
be  expanded  to  include  target  acquisition. 

22.  This  model  has  also  been  solved  analytically.  A typical  battle 
is  illustrated  in  Figure  8,  based  on  the  following  assumptions: 

F = .005  G = .010  fR  = .50 

R = .010  B = .010  fB  = .50 


100 


K = .010 


.50 


.50 


E = .015 

9g  = 

1 - gB  = 

.95 

H * .02 

J * .03 

1 - fR  = 

.50 

C = .01 

D = .03 

1 - fB  = 

.50 

W(Y7)  = - .04Y7 


Z (X7)  = - .04X? 


Figure  8 shows  the  Blue  and  Red  survivors  of  each  type  of  weapon  versus 
time.  After  30  minutes  the  Blue  tanks  are  down  to  less  than  50%  of 
initial  strength,  Red  has  lost  8 artillery  tubes  and  Blue  has  almost  no 
air  defense  left.  The  Red  tactic  was  to  use  half  of  its  artillery  and 
half  of  its  aircraft  against  Blue  air  defense. 

23.  In  the  above  example  it  was  assumed  that  aircraft  losses  are  directly 
proportional  to  the  density  of  air  defenses.  Some  insights  concerning  the 
conditions  under  which  this  might  be  a valid  assumption  can  be  obtained 
from  duel  models  and  force-on-force  models  of  attack  aircraft  versus 
complexes  of  air  defense  systems.  The  next  model  is  a stochastic  duel 
between  an  attack  helicopter  and  a single  air  defense  weapon,  including 
the  following  factors: 

a.  detection  capability 

b.  warning  of  being  detected 

c.  rate  of  fire 

d.  weapon  accuracy 

e.  weapon  lethality 

f.  vulnerability 
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The  capability  of  the  air  defense  weapon  to  detect  the  helicopter 
and  the  capability  of  the  helicopter  to  detect  the  air  defense  weapon 
are  both  included,  as  well  as  the  likelihood  that  the  helicopter  will 
be  aware  that  it  is  detected  (e.g.,  painted  by  radar).  Rates  of  fire 
for  both  weapons  are  included  as  well  as  accuracy  and  lethality  factors. 

24.  The  basic  inputs  for  the  model  are  listed  in  Table  5.  Using  these 
inputs  the  model  calculates,  for  the  total  encounter  of  T minutes,  the 
probability  that  the  air  defense  weapon  is  killed,  the  probability  that 
the  helicopter  is  hit  (but  not  necessarily  killed)  and  the  probability 
that  the  helicopter  is  killed.  This  third  measure  is  illustrated  in 
Figure  9,  based  on  the  inputs  defined  earlier.  Using  this  model  one 
can  study  the  sensitivity  of  helicopter  (or  air  defense  weapon)  success 
to  variations  in  a number  of  system  performance  and  tactical  parameters, 
such  as  the  following: 

flight  tactics 
bursts  fired 
rounds  per  burst 

detection  capability 
warning  equipment 
jamming 

readiness,  decisions,  response  times 

round  lethality 
burst  pattern 
rate  of  fire 
dispersion. 

25.  The  final  example  of  a combat  model  illustrates  how  models  can  be 
developed  to  represent  any  one  of  three  concepts: 

a.  changing  resource  employment  rates 

b.  diminishing  returns 

c.  increasing  or  decreasing  effectiveness  with  time. 

The  model  involves  both  Red  and  Blue  employing  support  weapons  at  rates 
that  change  with  time  (see  Figure  10).  The  support  weapons  could  be 
attack  aircraft,  mortars,  artillery,  tactical  nuclear  weapons  or  mines. 
Blue  and  Red  employment  rates  are  each  characterized  by  three  parameters. 
These  exponentially  decreasing  rates,  together  with  the  single  weapon 
effectiveness,  combine  to  kill  the  basic  elements  (e.g.,  tanks)  in 
addition  to  the  ability  of  these  basic  elements  to  kill  each  other,  as 
illustrated  in  the  upper  half  of  Figure  11.  The  Blue  tank  survivors 
versus  time,  as  shown  in  the  lower  part  of  Figure  11,  is  one  half  of  the 
solution  of  this  model.  A similar  expression  is  available  for  Red  tank 
survivors . 

26.  The  Blue  force  could  use  these  expressions  to  find  optimum  values 
for  n,  m,  and  b (representing  the  employment  tactics  for  support  weapons). 
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This  problem  is  a special  case  in  the  calculus  of  variations  --  to  find 
the  optimum  function  from  a given  class  of  functions. 

27.  We  have  developed  a facility  for  rapid  design  of  mathematical  models 
tailored  to  particular  problems.  In  Tables  6 and  7 some  of  the  design 
variables  are  reviewed.  Sensitivity  analysis  of  uncertainties  is  treated 
either  as  variations  of  key  factors  in  deterministic  models  or  as  random 
variables  in  stochastic  models.  Representation  of  optimum  tactics,  in 
terms  of  allocation  decisions,  has  been  illustrated  in  several  different 
models.  Clearly,  various  tradeoffs  between  quantity  (e.g.,  number  of 
tanks  initially  available)  and  quality  (in  terms  of  either  vulnerability 
or  killing  power)  can  be  easily  studied  with  these  models  --  finding  those 
combinations  of  quantity  and  quality  that  yield  the  same  results  (e.g., 
number  of  Red  tanks  killed  by  the  end  of  three  hours). 

28.  Using  cost  constraints  and  cost-quantity  relations  the  models  can 
be  used  to  find  optimum  mixes  and  break-even  points  --  using  combat 
measures  of  effectiveness  that  are  consistent  with  the  other  applications. 
A new  objective  will  be  to  incorporate  adaptive  tactics  into  some  of  the 
models,  so  that  the  model  can  learn  and  develop  new  tactical  decisions 
not  anticipated  during  the  formulation  of  the  model.  Comparisons  of 
equal  cost  forces  on  the  basis  of  differences  in  effectiveness  would  be 
straightforward.  In  addition,  with  these  models  we  can  set  an  effect- 
iveness goal  and  calculate  the  number  of  systems  that  are  required  (and 
their  total  cost)  for  an  alternative  force. 

29.  Using  appropriate  members  of  the  following  families  of  functions: 


a.  be  + f 

b.  bxk  —i 

r -cxk 

c.  b 1_1  - e J 

most  cases  of  diminishing  returns,  increasing  or  decreasing  effectiveness 
and  increasing  or  decreasing  resource  commitment  rates  can  be  represented, 
as  illustrated  in  the  last  model  presented.  Examples  of  increasing  or 
decreasing  effectiveness  are:  new  weapons  employed  for  the  first  time; 

increasing  hit  probability  as  range  decreases;  targets  taking  cover; 
voluntary  suppression. 

30.  Expansion  of  coefficients  was  illustrated  in  several  cases,  to  rep- 
resent system  parameters,  environmental  constraints  and  tactical  alter- 
natives. Further  model  developments  will  incorporate  more  interdependence 
between  coefficients,  such  as  artillery  fire  making  targets  less  vulner- 
able to  direct  fire  or  mines  making  targets  more  vulnerable  to  direct 
fire.  Various  non-linear  relations,  such  as  that  between  aircraft  and 
air  defense  are  candidates  for  variation.  Alternative  levels  of  aggre- 
gation could  be  applied  separately  to  the  representations  of  direct  fire, 
fire  support,  target  acquisition,  etc. 
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31.  A model  could  include  separate  equations  for  each  echelon:  company, 

battalion,  brigade.  Movement  of  forces,  depending  on  strengths,  time, 

and  terrain  could  also  be  included.  In  item  21  of  Table  7,  "lateral  units" 
refers  to  representation  of  mutual  support  (or  failure  to  support).  Future 
model  developments  will  also  include  more  explicitly  the  following  features 

a.  mission 

b.  doctrine  and  tactics 

c.  Situation 

(1)  Physical  environment 

(2)  enemy  alternatives 

d.  conditional  decision-making 

e.  system  performance 

f.  constraints 

g.  target  acquisition 

h.  supply 

i.  fire  support 

j.  weather 

k.  communications 

l.  organization 

m.  training 

32.  A number  of  approaches  for  validation  of  the  types  of  mathematical 
models  presented  in  this  paper  are  being  considered.  These  are: 

a.  specific  input  data  from  tests 

b.  high  resolution  simulations 

c.  other  mathematical  models 

d.  DIVWAG  submodels 

e.  results  of  other  analytical  studies 

f.  historical  data 

g.  hand  simulations 

h.  military' judgment 
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Table  I 


Types  of  Applications  of  Models 

1.  Effectiveness  current  forces 

2.  Compare  alternative  forces 

3.  Effectiveness  current  systems 

4.  Compare  alternative  systems 

5.  Evaluate  alternative  strategies,  tactics,  concepts, 
deployments,  doctrine 

6.  Tradeoffs  between  systems,  optimize  force  mixes 

7.  System  performance  requirements 

8.  Force  requirements jnumbers  of  systems  required 

9.  Analyze  energy  tactics  and  friendly  force  countermeasures 

10.  Impact  of  constraints  (political,  environmental,  opera- 
tional ) 

11.  Unit  performance  inputs  for  theater-level  models 

12.  Define  test  data  required  and  extrapolate  test  data 


TECHftCAl  RfPCSTS  SBCTH5K 


Measure  of  Effectiveness 


US  Alternatives 
Force  A Force  B 


Enemy  losses 


Tanks 

Personnel 


580 

T,80tT 


650 

"TOT* 


Time  required  for  enerny  mission  (hrs) 


23 


27 


US  losses: 


Tanks 


Personnel 


250 


4,200 


300 


5,800 


Tank  loss  ratio 


Enemy 

us — 


2,32 


2.17 


Figure  1.  Typical  Divwag  Results 


Assumption 

Effectiveness 
Force  A 

of  US  forces 
Force  B 

Size  of  enemy  attack 

1 . Standard 

67 

71 

2,  Larger 

45 

65 

Amount  of  US  close  air  support 

1.  Standard 

67 

71 

2.  Larger 

74 

76 

3.  Smaller 

56 

40 

Figure  2.  Standard  scenario  problem 
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Tank  loss  rates 

Artillery  loss  rates 

dB  * JR  + fLrG 
dt 

dR 

dt  « KB  + gMpH 

dH  = Er  (1-f)  G 
3t 

dG  = Fp  (1-g)  H 

(Tf 

Figure  3.  Analytical  model 


Table  2.  Classes  of  Two-sided  Combat  Models 


1. 


2. 


3. 


4. 

5. 

6. 

7. 

8. 


Deterministic  or  stochastic 
Linear  or  non-linear 
Coefficients  constant  or  variable 
Coefficients  aggregated  or  not 
Allocation  decisions:  No  or  Yes 
Types  of  weapons: 

Direct  fire  only  or  not 
Replacements:  No  or  yes 
Number  of  types  of  systems: 

Blue 

Red 
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Table  3.  Types  of  Weapons 


Direct  Fire 

Other 

Tanks 

Arti  1 lery 

Machine  Guns 

Mortars 

Infantry 

Hel  i copters 

Anti-tank  Missiles 

Ai rcraft 

Ai r Defense 

Mines 

Nuclear 

Red 

Tank  Loss  Rate 

dX  * 

BY  + CZ 

3t 

Blue  Tank  Loss  Rate 

dY 

= EfX 

at 

Blue  Anti-tank  Missile 

Loss 

Rate 

dZ 

= H (1  - f)  X 

at 

Ini! 

tial  Numbers:  Xo, 

Yo, 

Zo 

Red 

Tactic:  Fire  fraction 

f at  tanks. 

Figure  4.  Anti-tank  Weapon  Employment  Model 
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Figure  5.  Tank  and  ATM  Survivors 


Blue  Tanks  Surviving 

v (t)  = 

CH  (1-f)  Yo  - ECfZo  + 

BEf  + CH  (1-f) 

EfXo 

EBfYo  + ECfZo  cosh  ( t V BEf  + CH  (1-f)) 
BEf  + C H ( 1 - f ) 

+ 

-yBEf  + CH  (1-f) 

sinh  (t  v BEf  + CH  (1-f)  ) 

x(t) 

Force  Ratio  Surviving: 

a Y(t)  + bZ  (t) 

Figure  6.  Anti-tank  Weapon  Employment  Model 
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Table  4.  Air  Defense  Suppression  Model 


1.  Tanks 

Attack 

Tanks 

2.  Artillery 

Attack 

Artillery 

3.  Artillery 

Attack 

Air  Defense 

4,  Air  Defense 

Attack 

A1 rcraft 

5.  Aircraft 

Attack 

Tanks 

6,  Aircraft 

Attack 

Air  Defense 
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dY,  = 
dt 

- FY.  - GfR  v6 

- IW,  - BfB  Xfi 

Blue 

dx4 

dt 

■ - to  - gR)  v4 

Red 

0V4 

dt 

= - E(i-gB)  x4 

Blue 

dX7 

dt~ 

'JSE 

- - HgR  Y4  - JO-fR)  y6 

Red 

dY? 

dt 

- - CsB  X,  - 0 X6 

Blue 

dx6 

dt 

= W (y7) 

Red  dY, 


dt 


Z (X7) 


AIRCRAFT 


Table  5.  Duel  Model  Inputs 


Helicopter 

AD  Weapon 

Hit  Probability: 

Warned  target 

A 

Unwarned  target 

B 

Kill  Probability 

C 

D 

Detect  First  Probability 

E 

1~E 

Probability  Aware  of  Detection 

F 

Rate  of  Fire 

V 

R 

Duration  of  Encounter 

T 

T 
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Figure  9.  Sample  Model  Output 


Figure  10.  Support  Weapon  Employment  Rate  Model 
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_ ct 

dB_  = - KR(t)  - Mo  (P  - r)  e * Mo  r 
Blue  dt 


Red  dR  bt 

dt  = - JB(t)  - No  (n-m)  e - No  m 


Figure  11.  Loss  Rates  of  Maneuver  Elements 
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Table  6.  Flexibility  and  Variations 


1.  Sensitivity  analysis  of  uncertainties 

2.  Optimum  tactics,  employment,  allocatTom 

3.  Quantity  versus  quality 

4.  Optimum  mix:  tradeoffs;  break-even  points 

5.  Adaptive  tactics 

6.  Compare  equal  cost  forces 

7.  Increasing  effectiveness 

8.  Decreasing  effectiveness 

9.  Increasing  commitment 

10.  Decreasing  commitment 

11.  Time-varying  constraints 

12.  Diminishing  returns 

13.  Factors  within  coefficients 

14.  Interdependence  between  coefficients 

15.  Non-linear  relations 

16.  Alternative  measures  of  effectiveness 

17.  Time  period  variations 

18.  Alternative  levels  of  aggregation 

19.  Several  echelons 

20.  Movement  of  forces 

21.  Lateral  units 

22.  Model  non-combat  operations 
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HOMEOSTATIC  CRITERIA  FOR  ASSESSMENT  OF  THE  MORPHOLOGICAL  STATE 


OF  LARGE  SCALE  HYBRID  ANALYTICAL -SIMULA! ION  MODELS 


Howard  M,  Rratt 

Reliability  and  Maintainability 
Modeling  and  Analysis  Section,  Eustis  Directorate 
US  Army  Air  Mobility  Research  & Development  Laboratory 
Fort  Eustis,  Virginia  23604 


A valid  simulation  data  interval  should  represent  an  unbiased  slice 
of  time  taken  from  a continuous  process*  Evaluating  model  achievement  of 
stability  is  a process  unique  to  each  simulation  model  and  to  the  simula- 
tion experiment  under  test.  That  is,  a system  has  achieved  a homeostatic 
condition  when  the  major  parameters  by  which  the  system  performance  is 
measured  have  reached  a morphologically  unchanging  state. 

This  paper  discusses  the  length  of  the  stabilization  period 
which  should  precede  the  data  gathering  portion  of  the  simulation  run 
(Fig  1).  The  simulation  model  under  consideration  is  the  Army’s  Aircraft 
Reliability  and  Maintainability  Simulation  model,  a company  level  tactical 
operational  model  described  in  reference  1 and  reference  2, 

The  analytical  algorithms  used  in  the  stochastic  ARMS  Model  to 
establish  the  approximate  attainment  of  a morphological  state  (stability) 
are  as  follows; 

1*  Choose  an  acceptable  stochastic  parameter  and  sample  its 
magnitude  at  even  increments  of  time,  e,g.,  once  every  day, 

2.  Put  each  sample  in  a table  which  also  contains  all  prior 
samples  up  to  this  time. 

3.  As  the  ARMS  model  used  General  Purpose  Simulation  System 
(GPSS)  Language,  the  current  mean  and  standard  deviation  are  readily 
available  as  each  new  sample  is  entered  into  the  table. 

4.  Divide  the  table  standard  deviation  by  the  square  root  of 
the  current  number  of  samples  to  obtain  the  standard  error  of  the  mean. 

5.  Carry  on  a simultaneous  computation  to  obtain  a "smooth 
exponential  mean"  utilizing  a 20%  tracking  coefficient  by  summing  80%  of 
the  past  value  of  the  ’’smooth  exponential  mean”  plus  20%  of  the  latest 
sample.  (This  in  turn,  becomes  the  prior  (807o)  value  for  the  next  sample 
(20%))* 
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6.  When  the  value  of  the  smooth  exponential  mean  falls  within  a 
band  formed  by  the  table  mean  + the  standard  error  obtained  in  (4)  above, 
it  is  assumed  that  the  model  is  stabilized. 

This  tracking  algorithm  is  simple  enough  but  its  application 
requires  considerable  judgement  and  may  vary  between  experimental  scenarios* 
In  the  first  place,  it  does  not  absolutely  guarantee  the  existence  of  the 
morphological  state  for  every  stochastic  parameter  in  the  model*  Only  a 
set  of  algorithms  monitoring  every  key  stochastic  variable  could  do  that 
and  the  ARMS  model's  homeostatic  criteria  is  monitoring  only  one  stochastic 
parameter.  Obviously,  the  choice  of  the  parameter  to  be  monitored  is 
critical  and  it  has  taken  considerable  effort  to  arrive  at  the  correct 
choice , 


Some  characteristics  of  the  ARMS  model  require  explanation  in  order 
that  the  reader  understand  the  rationale  for  the  choice  of  the  stability 
criteria  parameter.  The  ARMS  model  is  close  to  being  "universal"  in  the 
sense  that  it  can  accept  almost  any  combination  of  number  and  type  of 
aircraft,  operational  utilization  flight  frequencies,  maintenance  and 
repair  policies,  manpower,  ground  support  equipment  and  logistics  supply 
systems.  The  output  statistics  are  readily  convertible  to  operational 
cost  estimates  and  mission  performance  effectiveness  assessment. 

Prior  to  start  of  a simulation  run,  each  aircraft  is  initialized 
by  assignment  of  a prior  number  of  flight  hours  and  calendar  days  to 
provide  an  approximately  equal  spacing  between  future  flight  hour  a/o 
calendar  day  inspections  (Fig*  2,  3).  Highly  critical  components  some- 
times have  a TRO  (Time  Between  Overhaul)  requirement  and  these  components 
on  each  airplane  are  also  initialized  in  relatively  equal  increments 
over  their  individual  (each  components  TBO  requirement  may  be  different) 

TBO  period.  All  of  this  initializing  is  very  helpful  in  speeding  up 
the  simulation  stabilization*  However,  when  the  simulation  starts,  no 
aircraft  has  any  required  maintenance  actions  and  all  are  setting  in 
the  availability  pool  ready  to  go.  This  is  not  normal  and  until  we  have 
a "typical"  number  of  maintenance  actions  distributed  over  the  aircraft 
company  we  should  not  start  collecting  data. 

Let  us  return  once  again  to  the  discussion  of  some  of  the  details 
of  the  ARMS  model  which  will  aid  in  the  choice  of  the  stability  parameter. 
Airplane  components  fail  in  the  ARMS  model  only  during  periods  in  which 
they  are  operating.  Discovery  of  a component  failure  at  the  time  it 
occurs  is  a probability  assigned  by  the  analyst  to  each  element.  If  not 
discovered  at  times  of  failure,  the  probability  of  it  being  discovered  at 
each  subsequent  event,  e.g.,  flight,  inspection,  etc.,  is  provided  by 
the  analyst.  Up  to  this  point  we  have  identified  THREE  parameters  of 
potential  interest. 
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1.  Failures  which  have  occurred  and  were  discovered  at  the  time 
of  occurrence, 

2,  Failures  which  have  occurred  and  have  not  yet  been  discovered 

and , 

3*  Failures  which  have  occurred  and  were  not  discovered  at  time 
of  occurrence  but  have  subsequently  been  discovered. 

Because  of  another  feature  of  ARMS  model  policy,  none  of  the  above  para- 
meters can  be  used  in  their  present  form  for  our  criteria.  This  feature 
is  the  categorizing  of  component  failures  into  "Hard  Down"  (grounding 
the  aircraft  until  repaired)  and  "Deferred  Maintenance  Actions"  (fix  as 
soon  as  possible  but  fly  all  required  missions  in  the  meantime) * In 
addition,  the  ARMS  analyst  identifies  the  maximum  number  of  "Deferred 
Maintenance  Actions"  he  will  permit  on  any  one  aircraft  and  any  deferred 
maintenance  actions  in  excess  of  this  number  will  also  send  the  aircraft 
"Hard  Down."  Because  of  this  variability  in  the  number  of  allowable 
deferred  maintenance  actions  per  aircraft  in  different  simulation  experi- 
ments, the  daily  total  of  deferred  maintenance  actions  is  not  a good 
indication  of  stability,  However,  because  both  hard  down  maintenance 
actions  and  deferred  maintenance  actions  eventually  result  in  the  assign- 
ment of  maintenance  actions  to  be  worked  off  on  the  aircraft,  this 
parameter  "the  number  of  maintenance  actions  assigned  to  be  worked  off 
in  any  24  hour  period"  becomes  the  selected  stability  criteria  parameter. 

By  normalizing  all  stability  computations  to  a constant  number  of  aircraft 
(we  chose  30)  base,  our  stability  criteria  becomes  independent  of  the 
number  of  aircraft  in  any  experiment  and  our  selected  criteria  for  achieve- 
ment of  stability  (although  not  the  time  required  to  achieve  it)  is  inde- 
pendent of  the  aircraft  utilization  rates.  We  also  conducted  other  investi- 
gations  relative  to  the  best  value  to  use  for  the  tracking  coefficient 
prior  to  our  selection  of  80%  of  the  last  value  plus  2Q7o  of  the  new  sample. 

When  the  stability  criteria  has  been  satisfied,  the  model  auto- 
matically transitions  from  the  stability  period  to  the  data  gathering 
period  on  midnight  of  the  following  Sunday.  At  this  time,  many  of  the 
data  values  accumulated  during  the  stability  period  are  re-initialized 
(zeroed  out)  while  other  parameters  remain  unchanged  (Fig,  4), 

Again,  there  is  no  guarantee  of  stability  with  the  selected  cri- 
teria and  the  analyst  must  use  reasonable  judgement  in  the  evaluation  of 
the  stability  requirements  of  any  given  experiment.  *We  have  found  that 
in  a reasonably  active  simulation,  this  criteria  will  cause  the  model  to 
run  3-4  weeks  to  attain  stability.  The  length  of  the  data  run,  after 
stabilization,  is  whatever  was  chosen  by  the  systems  analyst. 
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THE  APPLICATION  OF  INFINITESIMAL  TRANSFORMATION  GROUPS 
TO  THE  SOLUTION  OF  NONLINEAR  PARTIAL  DIFFERENTIAL  EQUATIONS. 


George  W*  Ullrich 


US  Army  Mobility  Equipment  Research  and  Development  Center 

Fort  Belvoir,  Virginia  22060 


ABSTRACT.  The  fundamental  aspects  of  the  method  of  infinitesimal 
transformation  groups  toward  determining  the  global  invariance  group 
of  partial  differential  equations  are  reviewed.  The  primary  discussion 
rests  upon  the  application  of  this  method  to  a system  of  two  nonlinear 
partial  differential  equations  describing  a biological  diffusional  trans- 
port process.  With  the  full  invariance  group  determined,  we  proceed 
to  find  invariant  solutions  for  those  equations  under  independent  one- 
parameter  subgroups.  For  this  manifold  of  self- similar  solutions  the 
governing  partial  differential  equations  reduce  to  ordinary  differential 
equations  which  involve  only  the  similarity  variables.  The  latter 
equations  are  solved  explicitly  for  those  cases  which  yield  one- 
dimensional traveling  wave  solutions, 

INTRODUCTION.  In  order  to  solve  the  nonlinear  partial  differential 
equations  encountered  in  nature,  the  mathematician  must  choose  prudently 
from  among  a variety  of  ad-hoc  analytical  methods  of  attack.  While 
there  is  no  general  extant  theory  for  nonlinear  PDEs,  some  powerful  and 
far-reaching  methods  of  analysis  have  been  advanced.  In  this  discussion 
we  present  a group-theoretic  method  that  may  be  employed  to  systematically 
generate  self- similar  solutions  for  both  linear  and  nonlinear  PDEs. 

BACKGROUND.  Consider  the  set  of  one-to-one  continuous  transformation 
which  map  the  x-y  space  into  itself, 

x ' = x' (x,y;e) 
y'  = y' (x,y;e) 


(1) 


TECHNICAL  REPORTS  SECT Htt  1 
ST5NF0  BRANCH 
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where  e denotes  an  arbitrary  parameter.  The  transformations  (1) 
comprise  a group  provided  that: 

1)  Successive  transformations  satisfy  a closure  property 

2)  Each  transformation  has  an  inverse  which  is  a member  of  the  group 

3)  There  exists  an  identity  element  e so  that 

x'(x,y;e)  = x , y'(x,y;e!  = y • 

Expanding  the  right-hand-side  of  Eqs.  (1)  about  the  identity  transformation, 
denoted  by  e = e , we  obtain 

x'  = x + (e-£)X(x,y)  + 0(e-§)2 

y ' = y + (e-£)Y(x,y)  + 0(e~£)2  ^ 

Eqs.  (2)  constitute  a group  of  infinitesimal  transformations  in  the 
neighborhood  for  the  identity  transformation,  i.e,  , they  satisfy  the 
aforementioned  group  properties  to  within  o(e-£). 

Almost  100  years  ago  Lie  (1881)  began  to  develop  his  theory  of 
one-parameter  groups.  And  although  he  discussed  the  application  of 
infinitesimal  transformation  groups  to  the  solution  of  PDEs,  subsequent 
work  followed  mostly  along  the  path  of  finite  groups.  Most  notable  is 
the  work  of  Dickson  (1924)  and  Cohen  (1931)  who  demonstrated  the  utility 
of  group  theory  for  integrating  ordinary  as  well  as  partial  differential 
equations.  Recognizing  the  relationship  between  dimensional  analysis 
and  group  theory,  Birkhoff  (1950)  proposed  a systematic  analytical  pro- 
cedure for  reducing  the  number  of  independent  variables  in  a system  of 
PDEs,  thus  facilitating  integration.  Morgan  (1952)  provided  a generali- 
zation of  this  method  with  rigorous  proof  of  its  validity.  In  essence, 
the  Morgan  theory  states  that  any  system  of  PDEs,  each  of  whose  member 
equations  is  conformally  invariant  under  a one-parameter  group  of 
transformation,  may  be  reduced  to  a simpler  system  involving  one  less 
independent  variable.  This  so  called  similarity  reduction  is  effected 
by  transforming  the  original  n dependent  and  m independent  variables  to 
the  set  of  n dependent  and  m-1  independent  similarity  variables  which 
are  the  functionally  independent  group  invariants.  To  each  one- 
parameter  group  there  corresponds  a self-similar  (invariant)  solution 
which  is  obtained  by  solving  the  simpler  reduced  system. 
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For  example,  consider  the  steady-state  heat  conduction  in  a hollow 
right  circular  cylinder.  The  temperature  distribution  is  governed  by 
Laplace's  equation 


T + T = 0. 
xx  yy 


(3) 


It  is  straightforward  to  show  that  Eq.  (3)  is  conformally  invariant 

(i.e.,  T^,x,  + Ty , y , = o^x.y.eJtT^  + Tyy]  ) under  the  rotation  group 


x'  = xcosO  + ysinfi 
y'  = xsinQ  + ycos0 

T'  = T 


(4) 


where  0 is  the  group  parameter.  Thus,  the  Morgan  theory  predicts  a 
reduction  in  the  PDE  (3)  to  an  ODE  by  introducing  the  similarity  variables 


R = (x2+y2)1/2 
T = T(R) 

which  are  the  invariants  of  the  group  (4).  Eq.  (3)  then  becomes 


d2T  1 dT  _ 
dR2"  R dR 


(6) 


One  should  note  that  the  Morgan  theory  does  not  supply  an  effective 
method  for  determining  all  of  the  admissible  one-parameter  groups  of 
transformations.  A further  deficiency  is  the  lack  of  a formal  mathematical 
procedure  for  establishing  the  group  invariants.  Consequently,  in  a 
typical  application  of  the  theory,  simple  one-parameter  groups  of  trans- 
formation are  assumed  so  that  the  invariants  may  be  determined  by 
inspection,  as  demonstrated  above. 

The  latter  shortcomings  are  eliminated  by  the  use  of  infinitesimal 
groups  as  first  shown  by  Ovsjannikov  (1969),  With  this  method  all 
similarity  variables  and  the  functional  forms  of  the  concomitant  self- 
similar solutions  may  be  derived  directly  from  the  infinitesimal  version 
of  the  full  invariance  group  of  transformations.  Further  applications 
and  ramifications  of  the  method  are  given  by  Bluman  and  Cole  (1969), 
Woodard  (1971),  Ames  (1972),  Rosen  and  Ullrich  (1973),  and  Ullrich  (1974). 
In  the  following  analysis  we  elucidate  the  fundamental  aspects  of  the 
infinitesimal  similarity  method.  However,  the  discussion  will  primarily 
focus  on  its  application  to  a specific  system  of  PDEs. 
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SIMILARITY  ANALYSIS  BY  INFINITESIMAL  TRANSFORMATION  GROUPS  . 


Consider  the  second-order  system  of  nonlinear  PDFs, 


39 (x,t ) _ 1-n  5^(x,t ) (7) 

3t  X 3x 

= l-n06  3_/0-6  30v  (8) 

at  x at1  ax;  * 


For  the  case  of  one-dimensional  symmetry  (n=l)  and  the  identification, 


a _ / s d-m 

0 | (— ) , 0 < m < 1 

7—  = -(1-m  ;ks  b 

dt  oo 

(9) 

(10) 

3 = 6/(l-m)y 

(ID 

Eqs.  (7)  and  (8)  become  (Ullrich,  1974) 

3b  _ 32b  .9,3  . 

3t  ‘ P 3^  - 4 3?(b  3xtas) 

(12) 

If  * -*■»  • 

(13) 

Letting  b (x,t)  and  s (x,t)  denote  bacterial  concentration  and  substrate 
agent  concentration,  respectively,  and  considering  u/S  and  k as  constant 
parameters,  Eqs.  (12)  and  (13)  comprise  a model  for  a biological  diffusional 
transport  process  known  as  bacterial  chemotaxis*  (Keller  and  Segel,  1971  ). 
In  particular,  the  latter  system  of  equations  describes  the  propagation  of 

plane  - symmetric  bands  of  bacteria  in  a capillary  containing  a single  sub- 
strate chemotactic  agent  that  is  consumed  in  the  process.  An  extension 
of  these  equations  to  cases  of  two-  and  three-dimensional  symmetry  is 
given  by  Eqs.  (7)  and  (8)  with  n=2  and  n™3,  respectively. 


^Chemotaxis  is  the  apparently  purposeful  response  of  some  species 
of  motile  bacteria  when  subjected  to  gradients  of  a critical  chemical 
in  the  substrate  medium.  The  response  is  characterized  by  a spatial 
ordering  (swarming)  of  bacteria  into  distinct  bands  (1-D)  or  rings 
(2-D)  which  move  preferentially  toward  higher  or  lower  concentrations 
of  the  substrate  agent. 
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To  recapitulate,  the  manifold  of  self-similar  solutions  to  a system  of 
PDEs  is  uniquely  determined  by  the  invariant  solutions  of  those  equations 
under  appropriate  one-parameter  groups  of  transformations.  Accordingly, 
we  require  Eqs.  (7)  and  (8)  to  be  simultaneously  invariant  under  the 
generic  one-parameter  group  of  transformations 
x'  = x' (x,t,u,v;e) 

t'  = t ' (x,t ,u,v;e ) 


u'  = u' (x,t,u,v;e)  ^ 

v'  = v' (x,t,u,v;e) 

where  u and  v are  the  generic  dependent  variables  ( u = 0(x.t) 

and  v = Mx,t)  being  unique  solution  surfaces^  , and  e is  the 

group  parameter. 

In  addition,  we  require  the  invariance  of  all  boundary  conditions 
fb(u,x,t)  = 0 & C.(v,x,t)  = 0 given  on  the  curves 

3.(x,t)  = 0 ana  -yu(x,t)  = 0 that  comprise  the  boundaries 

of  theJregions  of  definition  of  the  unique  solution  surfaces  u = 0(x,t) 

and  v = t)  , respectively.  Note  that  the  group  of  transforma- 

tions (14)  features  complete  mixing  of  the  independent  and  dependent 
variables,  thus  defining  a one-to-one  mapping  of  the  u,  v,  x,  t space 
into  itself. 


According  to  the  stipulated  invariance  conditions,  the  unique  solution 
surfaces  (associated  with  appropriately  prescribed  boundary  conditions) 
must  map  into  themselves,  i.e.,  the  solution  surfaces  must  be  invariant. 
Mathematically  this  is  stated  as 


u'  (x,t,0(x,t)  ,\|>(x,t  );e)  = e(x',t') 

v*  (x,t,0(x,t)  ,4i(x,t);e)  = b(x',t')  . 


In  principle,  Eqs.  (15)  are  sufficient  to  determine  the  admissible  functional 
forms  of  6 and  ^ , 

We  now  make  a decisive  simplification,  a la  Lie,  by  studying  the 
invariance  in  the  neighborhood  of  the  identity  transformation.  Consider  the 
infinitesimal  version  of  the  group  of  transformations  (14), 

x'  = x + eX(x,t,u,v)  + 0(e2) 

t'  = t + eT(x,t,u,v)  + 0(e2) 

u'  = u + eU(x,t  jU,v ) + 0(e2) 

v'  = v + eV(x,t,u,v)  + 0(e2) 


where  e=0  corresponds  to  the  identity  transformation.  It  can  be  shown 
(see  e.g.  Cohen,  1931)  that  invariance  of  a function  f(x,y)  under  a 
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one-parameter,  infinitesimal  group  of  transformations  is  a necessary  and 
sufficient  condition  for  invariance  of  the  function  under  the  corresponding 
finite  group.  The  advantage  of  using  the  infinitesimal  approach  in  our 
analysis  is  that  the  invariance  condition  in  the  neighborhood  of  the 
identity  implies  linear  (and  hence  tractable)  equations  for  X,  T,  U,  and  V. 

Using  Eqs.  (15)  and  (17)  we  proceed  to  write  the  infinitesimal  versions 
of  the  invariance  condition  for  the  solution  surfaces, 

0(x+eX,t+eT)  = 0(x,t)  + eU(x,t,u,v)  + 0(e2) 

. o \ 

t|i(x+eX.t-teT)  = i|/(x,t)  + ev(x,t,u,v)  + 0(e  ) 

Expanding  the  left  hand  side  of  Eq.  (18)  in  a Maclaurin  series  in  e and 
equating  terms  of  0(e) we  find, 

X(x,t,0,4')0x  + T(x,t  ,0  = U(x,t,e  ,!(/) 

X(x,t,0,^)it  + T(x,t  ,0  = V(x,t,0  ,4>)  , (19) 

X Xr 

the  equations  describing  the  invariant  surfaces. 

The  latter  Eqs.  (19)  are  first-order  and  quasi -linear  with  the  associated 
Lagrange  characteristic  equations 


dx 

dt 

d9 

X 

T 

U 

dx 

dt 

(20) 

X T V * 

We  can  find  an  integrating  factor  for  the  left  equality  of  Eqs.  (20)  if  X/T 
is  independent  of  0 and  $ . The  resulting  integral 

n(x,t)  = const.  (21) 

defines  similarity  curves  in  x,  t space.  From  the  right  equalities  of 
Eqs,  (20)  we  then  obtain 

u(x,t)  = F1(x,t,Ti,f1(n)) 

v(x.t)  = F2(x,t,ruf2(n))  (22) 

wheref1(n)and  f^Cn)  are  the  solutions  of  the  simultaneous  ODEs  obtained  by 
substituting  Eq.  (22)  into  Eqs.  (7)  and  (8).  Hence,  we  have  shown  that 
the  functional  form  of  the  similarity  solution  given  by  (22)  can  be  derived 
directly  from  the  infinitesimal  version  of  the  full  invariance  group. 
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We  proceed  to  determine  the  admissible  forms  for  the  quantities  X,  T, 

U,  V,  i.e. , those  infinitesimal  functions  for  which  ©'(x'jt')  and 

,t ' ) are  solutions  to  equations  (7)  and  (8)  with  all  the 

variables  primed.  This  calculation  requires  expressions  for  the  partial 
derivative  terms  appearincr  in  equations  (7)  and  (8)  with  the  variables 
primed.  Since  u = 9(x,t)  and  v = t(x,t)  are  the  solution  surfaces, 
it  is  expedient  that  all  partial  derivatives  be  calculated  with  respect  to  those 
surfaces  the  transformed  space  and  time  variables  then  being  given  by 
functions  of  the  form 

x'  = x'(x,t) 

t'  = t'(x,t)  , (23) 


From  Eqs.  (17)  and  (23)  it  follows  that 


- 1 - e[X  + X 0 + X ] + 0(e2) 

9x'  x u x vrxJ 

3x 

at' 
at 
at 

9t_ 

9x4 


- ■*-  xuet  + xv*t]  + o(«2) 

, = 1 - e[Tt  + Tu0t  + Tvi|>t]  + 0(e2) 

- e[T  + T 6 + T \p  ] + 0(e2)  . 

L X U X V X 


(24) 


Using  (24),  we  can  now  calculate  the  transformation  equations  that  relate 
the  various  partial  derivative  terms.  We  first  consider  the  transformation 
equations  for  u and  v given  by  (17)  along  the  solution  surfaces 
9 and  and  write 


e'U'.t')  = e(x,t)  + eU(x,t,e,40  + oU2) 

<r(x',t')  = li-(x.t)  + ev(x,t,d,ip)  + 0(e2)  . 

Hence  it  follows  that 

0t  + £{ut  + (uu-Tt)et  - xtex  - V2 

- Wx  + Vt " VA  ' VtV  + 0(e2)  • 


96  1 
9t ' 


(25) 


(26) 
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and 


9V 
9t ' 


*t + E{¥t + (W*t  - XA  - Vt  - W> + Vt 


- X i|i  0,  - T ip . 0 . } + 0(e2) 
u x t u t t 


(27) 


'i'he  spatial  derivative  terms  90 '/9x'  and  9ifi'/9x'  are  formed  from  Eqs.  (26)  and 
(27)  by  interchanging  the  roles  of  t and  x and  T and  X.  Finally,  the  required 
mixed  partial  derivative  term  follows  directly  from  (26) 


929 


3xf  9t 1 


0 , + £{u  , + (U  -T  )e . + (U  -X  )0  - T 02 

xt  xt  XU  Xt  t ut  xt  X XU  t 


x ,e2 

ut  X 


T 02ii  + (u  - T 
vu  trx 


uu 


-L-  ^ V I ^ ^ 1 

ut  XU  X t 


)e  ex  - x e2e 


uu  x t 


- X 0 V 0 

vu  t X X 


t e20  - x^e  - t 0. , + (u  - x - T.  )e  . 

UU  t X t XX  X tt  U X t xt 


-■  X 6 0,  - T 0 0 - 2X  0 J - 2T  0 X0X  + U 

U ' XX  t U tt  X u xt  X u xt  t XV  t 

+ U - X * 0 - t he,  + (u  - x u 0 

vt  X vtTx  X XVTt  t UV  • XV  t X 


(28) 


+ (U  - T ) 0 , ^ 

VU  Vt  t X 


X ip.ip  0 
TT  t X X 


T \p.ip  0 , + U ip., 
VT  t X t VrXt 


- X 0 ip,  - T 0,  - T 6 J - T e J - X 6 1(1  , 

V XX  t V tt  X V xt  t V t xt  V X xt 

- X ip  0 . - X 02iK  - T ^.0,9  + U ipti  } + 0(e2)  . 

V X xt  UV  X t uvrt  t X VV  t X 


Rewriting  Eqs . (7)  and  (8)  in  terms  of  the  primed  variables  and  substituting 
the  latter  transformation  formulas  for  the  derivative  terms,  we  obtain 


99  1 _ /.  il-n  9T 

at T 1 ; 9x ' 


90 

9t 


X 


1-n  3jp_ 
9x 


+ 


eQ1  + 0(e2) 


and 


(29) 


/ 929  1 

9x'  9t ' 


0(0') 


-1  36J_ 

3x' 


96  ' 
9x ' 


dtp  n-1  / 3 20 
9t  “ X 9x9t 


R0-1  IQ  11) 
60  9l  9t] 


+ eQ0  + 0(e2) 


(30) 
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where 


Q-i 


U + (U  - T, )0. 

t U t t 

" Tv^tet  " (l-n)x 


X40  -T0?-X0O  + U i - X 0 i 

t X U t U t X V t V X t 


-n 


Xip 


1-n 


[U  + (V  - X )\p  - T iJj 

. X V X X X t 


— X ^ 2 - T ip  ip , + V 0 

vYx  V X t U X 


T \p.d  - X \p  0 ] 

U t X U X X 


(31) 


and 

Q, 


= V, 


- Wt  - <"-i>*n'2xext 


■ x ip  0, 

U t U X t 


+ (V  - T,  )ip.  - x^ip  - T ip?  _ x i/m p + V 0 
v t t t x v’t  v t x u 

- xn-1[U  ,+  (U  - T O0,  + (U 

vt.  vm  vt.  +.  i 

T 


x )e 

xt-  X 


- T 0^  - X J2  + (U 
XU  t ut  x uu 


"ut 


X Q - T 0 , . + (U  -X  - T ) 0 , 

t XX  X tt  U X t xt 


)0  0 , 

- X i 

X t 

uu 

- X 0 

0 

u 

XX  t 

U ip 

+ u _■ 

xvrt 

vt 

t e42e 

UU  t X 


(32) 


- T 9i4.0  - 2X  9 , 0 - 2T  0 .0,  + U iJj  + U ,i/>  - X .tp  Q 

U tt  X u xt  X u xt  t XV  t vt  X VtT- 

- T *.8.  + (U  - X )^.  0 + (U  - T . ,dtip  - X iji.iji  8 

xvrt  t UV  XV  t x vu  vt ) trx  VV  t X X 

- T i*  0 + U 1|)  , - X 0 ^ - T 0.^  - T 0 ip  - T Q.ip  , 

VV t X t V xt  V XX  t V tt  X v xt  t v t xt 

-xe</<-xi|>0-x  92ik  - t i/i.e.e  + u - t 0?^ 

U X Xt  V X xt  UV  X t UV  t t X VV  t X vu  t 


+ X 


( d ip  e ] + (n-i  )xn  2@x0‘ 
xu  trx  xJ 

(u  - X )0  + U ip  -T9 

U XX  Vrx  X t 

n-1$e-1e  [u,  + (u  - t ) f 

X t u t 


1 

n-l„„ 

0 

e,  - 

X t 

x 30 

_ 

X 92  - 

T 0 0 

U X 

u t 

- X 0 

r3 

CD 

t 

t X 

U 

VU  t X 


u s 

X t 

r e,  * - 

v trx 

X 0 0 
u t x 


V"x 


vTx  X' 


v4 1 


X 0 T/i.  - T *.0,1  , 

V X t V t t 


Next,  we  ensure  the  invariance  of  the  solution  surfaces  by  requiring  that 
the  quantities  and.  in  Eqs . (29)  and  (30)  be  identically  equal  to  zero.. 
This  so  called  "classical  method11  results  in  a system  of  linear  partial 
differential  equations  that  must  be  satisfied  by  the  X,  T,  and  V. 

Since  both  0 and  * can  be  prescribed  arbitrarily  at  an  Initial  instant 
of  time,  we  must  equate  to  zero  the  coefficients  Of  the  functionally 
independent  terms  in  d,ip  and  their  derivatives  in  = 0 and  = 0* 

Now  0 and  ip  are  specified  by  the  right-hand  side  of  Eqs.  (7)  and  (8)  and 

Tj  l* 
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must  be  eliminated  by  substition.  the  homogeneous  coeffecient  equations 
that  ensue  are  linear  in  X,  T,  U,  and  V and  can  be  satisfied  by 

X = c^x  + ; Cg(n-l)  = 0 

T » 2c  t + c 

1 5 (33) 

u = C^e 

V = [c^  + (n-2)c1J^. 

These  latter  equations  comprise  the  infinitesimal  version  (Eqs,  ( IT ) ) of  the 
full  invariance  group  of  the  system  of  equations  (7)  &nd  (8)*  That  is, 

x?  = x + c^ex  + c^e  + 0(e2)  ; c^(n-l) 

tr  = t + 2c1et  + c^e  + 0(e2) 

0’  = 0 + c^e0  + 0(e2) 

ip1  = \p  + [c^  + (n-l)c^]e^  + 0(e2) 

where  c , c , c~,  and  Ci  are  arbitrary  constants. 

The  corresponding  finite  group  of  transformations  (which  for  this  simple 
case  may  be  written  by  inspection)  may  be  derived  formally  by  using  the 
Lie  group  generators.  The  result  is  stated  with  a convenient  redefinition 
of  the  translation  parameters  as 

c 

x'  = eax  + k ; k(ii-1)  = 0 ; k e — (eCl£  - l) 

C1 

Q 

t'  = e2at  + A ; a = c^e  , . A = (e2cle  - l)  (35) 

6 ’ - e^0  ; y = c^e 

r » e[Y  + (n-2)a]^  . 


= 0 

(34) 


Due  to  the  arbitrariness  of  the  constants  c^  , c^5  c , and  c^,  the 
latter  transformations  comprise  a 4-parameter  group,  displaying  transla- 
tinal  invariance  in  t,  translational  invariance  in  x (restricted  to  the 
one-dimensional  version  of  the  governing  equations),  similitudinous 
scale  invariance  in  t and  x,  and  scale  invariance  in  the  dependent 
variables  0 and  t[>.  Clearly,  the  transformations  (34)  have  *>4  one- 
parameter  subgroups.  Using  the  real  4-tuple  c = (c_,  , c , c„,  c,  ) as  a 

j j /.j-L 

label,  each  one-parameter  subgroup  is  designated  by  a fixed  c with  the 
common  subgroup  parameter  e ranging  over  the  entire  real  number 
scale. 

We  proceed  to  determine  the  classes  of  one-parameter  subgroups  that 
have  qualitatively  different  group-space  trajectories  and  consequently 
lead  to  functionally  independent  self-similar  solutions  to  the  original 
system  nonlinear  PDEs , 
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It  is  convenient  at  this  point  to  combine  the  original  second-order 
system  of  equations  (Eqs.  (7)  and  (8)  ) yielding  the  single  third-order 
nonlinear  PDE 


1-n 


yx 


n i03 


-g  38, 

3xJ 


(36) 


with  the  associated  invariance  group  of  infinitesimal  transformations 

•2' 

'1““  w2" 


x’  = x + C-,ex  + c_e  + 0(e2) 


t'  = 


t + + c0e 


+ 0(e2) 


0 ' - 0 + c^e0  + 0(e2) 


(37) 


For  the  class  of  one -parameter  subgroups  designated  by  C|  =/  0,  the 
Lagrange  characteristic  equation  (20)  becomes 


dx 

C1X  + C2 


dt 

2c  xt 


+ c. 


de 

c4e 


Integrating,  we  obtain  the  subgroup  invariants 

(x  t c^^)2 

n " * + C3/2C1  " 

9 

(t  7 c3/c1)‘V2c1  <38) 


which  are,  in  fact,  the  integration  constants.  Thus,  by  Eq.  (22),  the 
corresponding  general  subgroup  invariant  equation  for  0 , i.  e.  , the 
self- similar  solution,  takes  the  form  f = f(r))  or  equivalently. 


9 = (t  + c3/Cl)c4/ci  f(n) 


(39) 


The  second  class  of  one -parameter  subgroups  are  designated  by  c-^  = 0, 
and  lead  to  the  Lagrange  characteristic  equation 


dx  = dt  d0_ 

C2  " C3  C46 
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(40) 


With  the  further  restriction  of  ^ 0,  / 0,  we  integrate  the  left  and 

right  equations  in  (40)  and  again  obtain  two  integration  constants  which 
are  the  functionally  independent  subgroup  invariants 

n = x - (c2/c3)t 

f = exp[-(cit/c  ) t ] 9 . (41) 

The  functional  form  of  the  corresponding  self-similar  solution  for  0 is 
then  given  by 


0 = e(cl/c3)t  f(T1) 


(42) 


Eqs . (39)  and  (42)  comprise  the  manifold  of  self-similar  solutions  to 
Eq.  (36).  By  substituting  the  latter  expressions  into  Eq.  (3  6)  we  obtain 
an  ordinary  differential  equation  for  f(n)in  each  case.  In  the  following 
paragraphs  we  restrict  our  attention  to  the  wave  solutions  (42)  which  are 
of  more  immediate  physical  interest  in  view  of  the  existing  experimental 
data.  Specific  solutions  of  the  type  (39)  are  given  elsewhere  (Ullrich, 

1974)'. 

Defining  c 2/c3  = c as  the  constant  propagation  speed,  Eg.  (42)  describes 
steadily  propagating  waves  that  are  uniform  ( cl/c3  0 ) or 
exhibit  exponential  decay  ( c^/c^  < 0 ) orgrowth  ( ci/c3  > 0 '* 

Since  we  require  C2  / 0,  the  condition  (n-l)c2  = 0 in  (34)  restricts 

the  latter  wave  solutions  to  the  special  one-dimensional,  plane-symmetric 
case  of  the  governing  dynamical  equation  (36).  Furthermore,  it  is  note- 
worthy that  accelerative  wave  solutions  which  exhibit  a time-dependent 
propagation  velocity  are  precluded  by  Eq.  (42).  This  is  directly 
attributable  to  the  fact  that  the  invariance  group  (34)  does  not  admit 
independent  scale  transformations  in  x and  t. 

We  will  consider  two  cases  of  Eq.  (42)  denoted  by  = 0 and  ^ 0. 

For  the  former  case,  Eq.  (42)  reduces  to 

e = f(n)  , n = x-ct, 

which  when  substituted  into  Eq.  (36)  yield  the  ordinary  differential  equation 


ycf ' 


y 6cf 1 ' 

f7'- 


2ygcf 1 'f ' 


= 0 


(43) 
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Supplemented  with  the  boundary  conditions* 

f 0 , n - 05 

f 0 , n">+o° 

f'  ->  o , n ->  + « 

appropriate  for  the  Adler  band  experiments  (Adler,1969) , the  solution  to 
equation  (43)  is  given  by 


e = f(n)  = (1  + e-=n/W)-l/(S-l)  _ B >x 


(44) 


With  the  definitions  (9)  through  (11),  equation  (44)  begets  the  explicit 
expressions  for  the  bacterial  density  and  substrate  concentration 
distributions 


b 

s 


2 1-m 

k[S  - *l-m)ii]  e n/U(l  + e"cn//y)“6//f6  ' d-m)u] 
s (1  + e_cr>/y  s-u/ 1 <S  - (l-m)y  ] 

nr,'  * 


(45) 

(46) 


where  [l  - 6/y]  < m < 1*  (47) 

Eqs.  (45)  and  (46)  have  been  shown  to  be  in  good  agreement  with  experi- 
mental data  (Keller  and  Segel,  1971;  Rosen,  1974). 

For  the  case  jL  o,  we  consider  the  general  form  of  the  self-similar 
solution  (42), 


0 = e^f(ri)  , H = x - ct 


*Similarity  reductions  generated  by  the  Morgan  theory  become  physically 
useful  only  if  the  similarity  solutions  are  compatible  with  the  original 
auxiliary  conditions.  This  compatibility  requirement  can  be  tested  by 
checking  a posteriori  whether  the  auxiliary  conditions  are  expressible, 
without  inconsistency,  in  terms  of  the  similarity  variables.  This  will 
not  always  be  the  case.  For  example,  the  similarity  reduction  of  a PDE 
in  three  variables  with  two  boundary  conditions  and  one  initial  condition 
requires  that  the  original  auxiliary  conditions  coalesce  to  form  two 
boundary  conditions  on  the  independent  similarity  variable. 
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where  P 5 ci/c3  ‘ Unfortunately,  this  functional  form 
prevents  satisfaction  of  a boundary  condition  germane  to  the  one- 
dimensional, plane- symmetric,  bacterial-band  propagation  experiments 
(see  Adler,  1969),  viz.  , 


Nonetheless,  traveling  wave  solutions  that  exhibit  exponential  growth 
are  intuitively  physical  for  the  model  in  question.  In  fact  the  classic 
example  of  exponential  biological  growth  is  bacterial  growth  in  an 
unlimited  culture  medium  where  overcrowding  effects  are  precluded. 
Thus  we  propose  the  alternate  boundary  conditions 

f(n)  H 0 > n -»  - 00  s 

, . ; t = 0 

f(n)  * o , n + 05 


which  are  suggestive  of  an  experiment  wherein  a bacterial  band  originates 
at  each  end  of  a long  capillary  so  that  both  bands  propagate  toward  each 
other,  depleting  all  of  the  substrate  chemotactic  agent  behind  them. 

(Such  an  experiment  has  not  been  performed  to  date).  The  ODE  forf’(b)  that 
results  is  given  by 


ycf"'  + [c2  + y( $-l)p]f ' ' - £cpf ' + P2f 


2ygcf 1 1 f 1 


uScf ,3 


(48) 


Eq.  (48)  may  be  reduced  to  the  first-order  ODE 

u*  -a.  ((3  - 28) 

- {(l-g)z  + - Q-  + -p^lu3  = 0 

v ' yc  yz  ycz^ 

by  successively  imposing  the  variable  transformations 


y(f) 

= f'(n) 

z(f) 

2 y/f 

q(z) 

= fz' 

u(z) 

-l 

= a 

(49) 
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Eq.  (49)  is  an  Abel  equation  of  the  first  kind  which  for  small  values 
of  p (large  g ) has  as  its  formal  solution 


y(a+l)(3-26) 

2a 

rz 


+ 


[(i-e)z3  - 


1*0 

rY(a+l)(3-2g)z2 
L 2a 


+ fl3^2sT  + Yuiaau 

+ ( 3-2gzuQ]  X 

+ YUoJaai  = const- 


(50) 


where  y is  an  arbitrary  constant,  a is  a constant  whose  admissible 
values  are  prescribed  by 


a 


± 


-l  ± A5-U 

2 


_ ( 3—23 ) 2 

~ 2(1-6) 


and  u and  z are  constants  that  may  be  chosen  as  is  convenient. 

CONCLUSION.  In  conclusion,  we  have  demonstrated  the  utility  of 
infinitesimal  transformation  groups  toward  systematically  arriving  at  the 
manifold  of  self-similar  solutions  for  PDEs.  The  method  is  particularly 
suitable  for  nonlinear  PDEs  because  the  group-theoretic  approach  entails 
no  implicit  assumptions  of  linearity.  On  the  other  hand,  the  richness  of 
the  manifold  of  self-similar  solutions  is  dependent  on  the  degree  of 
symmetry  exhibited  by  the  governing  system  of  equations.  When  higher 
order  nonlinearities  are  present,  the  probability  of  uncovering  subtle 
symmetries  decreases  substantially.  In  the  final  analysis,  the  usefulness 
of  the  approach  advanced  in  this  paper  must  be  weighed  intuitively  for  each 
application. 


139 


REFERENCES 


Adler,  J.  (]  969).  Science,  N.Y.  166 , 1588. 

Ames,  W.F.  (]  972).  Nonlinear  Partial  Differential  Eqs. 

in  Engineering,  VoT7  II,  Academic  Press,  N.Y. 

Birkhoffi  GT "" (]  950) . 1 1 yd  rod  vn  ami  c s , ] st  ed.,  Princeton 

University  Press,  Princeton,  N.J. 

Bluman,  G.W.  § Cole,  J.D.  (]  969).  J.Math.  Mech.  18_,  No.  11,1025. 
Cohen,  A.  (] 931) . An  Introduction  to  the  Lie  Theory  of  One- 
Parameter  Group^  Stechert,  N.Y. 

Dickson,  (]  924)  . Ann.  Math.  (2)  2_5,  287. 

Keller,  E.S,  § Segel,  L.A.  (1971).  J.  Theor.  Biol.  3_0.  225, 

Lie,  S.  (1881).  Math.  (Kristiana)  6_,  328  . 

Morgan,  A.J.A.  (1952).  Quart.  J.  Math.,  Oxford  Ser.  2_,  250. 
Ovsjannikov,  L.V.  (]969).  Soviet  Math.  Dokl . 10,  538. 

Rosen,  G.  § Ullrich,  G.W.  (1973).  SIAM  J.  Appl7~~Math.  24,  #3,  286 
Ullrich,  G.W.  (1974).  Ph.D.  Dissertation,  Drexel  U. , PKila.,  Pa. 
Woodard,  H.  , Jr.  (1971).  Ph.D.  Dissertation,  University  of  Iowa 


140 


ON  UNIQUENESS  OF  PIECEWISE  POLYNOMIAL  APPROXIMATION 


C.  K.  CHUI,  P.  W.  SMITH,  and  J.  D.  WARD 
Department  of  Mathematics 
Texas  A&M  University 


ABSTRACT.  This  paper  deals  with  uniform  approximation  of 
continuous  functions  by  piecewise  polynomials*  A uniqueness  theorem 
for  placing  the  knots  which  balance  the  error  is  established. 
Counter-examples  are  also  given  to  show  nonunicity  for  many 

functions.  Numerical  examples  are  given  to  demonstrate  the 
efficiency  of  piecewise  polynomial  approximation  for  certain 
functions. 


1.  INTRODUCTION.  Although  the  ideas  involved  in  piecewise 
polynomial  approximation  are  quite  elementary,  it  has  been  only 
recently  that  much  effort  has  been  expended  in  studying  these 
approximates.  In  particular,  Burchard  [1],  Rice  [5],  and 
Handscomb  [2]  among  others  have  studied  various  facets  of  the  field. 


If  we  denote  by  P^  the  set  of  functions  which  have  k 
polynomial  pieces  of  order  n (degree  < n)  , then  one  of  our  results 

is:  Let  f £ C[a,b]  satisfy  f^n  ^ is  strictly  monotone.  Then 


f has  a unique  best  uniform  approximant  from  P^[a,b].  A weaker 

version  of  this  result  is  stated  in  [3],  We  will  give  an  independent 
and  complete  proof  of  our  result.  We  also  remark  that  Handscomb 

{2]  has  the  following  result  which  is  closely  related  to  this  above 
theorem. 


/ o \ 

Theorem  (Handscomb).  If  exists  and  is  strictly  convex  in 

[a,b],  then  the  best  uniform  approximation  to  f on  [a,b]  by  a 
spline  of  degree  (n  - 1) (n  > 3)  with  k free  knots  is  unique, 
all  of  its  knots  are  genuine  and  distinct,  and  all  its  coefficients 
are  positive. 

Notice  that Handscomb 1 s condition 

;(n-l) 


(f(n  ^ strictly  convex) 


is 


essentially  the  same  as  f 


strictly  monotone.  Furthermore, 


^ ^kn  w^ere  is  the  set  of  splines  of  order  n with  kn 

knots.  It  follows  that  dist  (f,S^  ) < dist  (f  ,p”)  . However, 

kn  k 


we  believe  that  it  is  much  easier  to  compute  the  best  approximant  to 


f from  than  from  . This  computational  advantage  we  feel 

justifies  further  research  into  piecewise  polynomial  approximation. 


2.  UNIQUE  BALANCED  PARTITIONS.  A mesh  of  order  k is  a non* 
decreasing  (k  + 1)- tuple  of  real  numbers  or  alternatively  it  is  a 
collection  of  k open  intervals  = (p^_^,p^),  i “ 1,  . ..,  k. 

If  u = (Uq,  u^)  is  a mesh,  let  k denote  the  order  and  \(u)  « 
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max  (u  - u _)  the  mesh-size  of  u.  A partition  is  a mesh  with  nonempty 
l<i<k  1 

intervals.  If  u is  a mesh  on  the  open  interval  (a,b),  i.e.,  = a 

and  = b,  then  Pn(u)  is  a collection  of  real  functions  on  (a,b) 
whose  restrictions  to  the  intervals  of  u are  polynomials  of  degree  at  most 

n - 1.  will  denote  the  set  of  all  piecewise  polynomials  of  degree  at 

most  n - 1,  whose  knots  form  a kth  order  mesh.  If  f e L^(a,b), 

1 < p < «,  let 


Ep  n^f,U^  = inf  {|i  f " s 


p, (a,b) 


£ Pn(u)} 


where  u is  a mesh  on  (a,b)  and  let 


E (f,k) 

P.n 


(a,b) 


inf  {E  (f ,u) : u = a,  u = b}  . 
u p > n 0 k. 


By  a balanced  mesh  u of 

property  that  E (f,I.) 

P»*i  J 


order  k,  we  mean  a mesh 
= E (f,I  ) for  all  I 

P»n  i i 


u having  the 

and  I.e  u. 
1 


By  an  optimal  mesh  u of  order  k,  we  mean  a mesh  u which  satisfies 

E (f,k)  = E (f,u). 
p>n  p,n 


In  [1],  Burchard  studied  piecewise  polynomial  approximation  on 
optimal  meshes.  His  primary  interest  was  to  determine  bounds  and 
asymptotic  limits  for  the  error  function  E (f,k). 


The  purpose  of  this  paper  is  to  exhibit  a fairly  general  class 


of  functions  having  a unique  best  approximant  from  in  L^Ca^b). 

We  first  derive  an  important  relationship  between  balanced  error  and 
optimal  partitions  which  has  special  computational  implications. 


Proposition  1,  In  L°°(a,b),  the  balanced  error  partition  exists  and  is 
an  optimal  partition. 


Proof.  The  existence  of  the  balanced  error  partition  is  a 
consequence  of  Lemma  3.2  of  [ l]  . Now  suppose  the  balanced  partition 
u * (tg,  . t^)  is  not  optimal.  Let  = (Tq»  . T^)  be  another 

partition  for  which  E (f,u.,)  < E (f,u).  Since  u is  balanced, 

®,n  1 <*>,n 

it  follows  that  < t^.  Similarly,  x2  < x^.^  < 

But  clearly,  this  is  impossible  since  E^  ^(f , (x^-^))  >_  E^  ft(f  ) = 

E (f,u).  This  completes  the  proof. 

«>>  n 


We  now  give  an  example  which  shows  that  in  spaces  other  than  L°°(a,b) 
a balanced  partition  need  not  be  optimal. 
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Example.  Our  approximating  manifold  in  L1  [ 0 , 4 ] is  the  piecewise 
constant  functions  with  one  variable  knot.  Let  f be  defined  as 


1 x e [0,2] 

f(x)  - J 2 x e [2,3] 

| h(x)  x e [3,4] 

where  h(x)  has  the  properties  that  C - 2 is  a best  constant 

approximation  to  h(x)  on  [3,4]  and  |j  h(x)  - 2||^  * 4/3.  Since 

C = 4/3  is  a best  constant  approximant  to  f (x)  on  [0,3],  the  knot 

at  x = 3 gives  rise  to  a balanced  error  partition  u = (0,3,4)  and 

E (f,u)  = 8/3.  On  the  other  hand,  by  choosing  the  knot  at  x = 2, 

JL  p JL 

we  can  select  our  best  approximant  to  be  y = 1 on  [0,2]  and  y = 2 
on  [2,4]  with  total  error  4/3.  Thus,  the  balanced  error  partition  is 
not  optimal.  We  note  that  we  could  have  made  f(*)  continuous  and  still 
obtained  the  same  result  with  only  slight  modifications  in  the  above 
example*  We  are  now  ready  to  state 

Theorem  1*  If  f e C[a,b]  and  f^n  ^ is  strictly  monotone  on  (a,b), 
then  f has  a unique  best  approximant  from  . 

The  proof  of  this  theorem  follows  directly  from  the  following  three 
lemmas  * 

Lemma  1.1.  If  f is  as  in  Theorem  1,  a < t^  < < b and  p*  is  the 

(unique)  best  nth  order  approximating  polynomial  to  f on  [t^t^l 
then 


-*  • £«  • ‘-1-2 


Proof.  Suppose  to  the  contrary,  that 


d.i) 


< iip*-fi[[t  1 1 • 


Then  by  the  Chebyshev  Alternation  Theorem,  there  exist  points 


T1 Vl 


with  at  which 


p*^)  - f(Ti)  - <-nl  [±  ||  p*  - f ||  [t  t .ji 


and  , i = 1,  ...»  n.  Further,  by  (1.1)  we  see  that 

(p*  - O' Up  = 0 , i = 1 n.  Thus,  by  Rolle's  theorem 

applied  n - 2 times  we  see  that  there  are  two  points  and 
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is  a constant  and 


in  (t^.tj)  so  that 


(p*  ~ f ) (n~1} (e.)  - 0 , i - 1,  2 . 

But  this  is  clearly  a contradiction  since  (p*)  ^ 

f^n  ^ is  assumed  to  be  strictly  monotone*  Of  course,  the  argument  is 
symmetric  if  in  (1.1)  we  replace  t1  by  t 2 and  this  completes  the 

proof  of  Lemma  1. 

We  now  obtain  two  corollaries  of  Lemma  1 which  are  needed  later. 


Corollary  1.1.  The  functional  E (f,(t. ,t„))  is  a strictly  increasing 
“=“====  ro,n  1 L 

function  in  t^  and  a strictly  decreasing  function  in  t^. 

Proof.  Recall  that  in  the  proof  of  Lemma  1,  (f  - p*)'  cannot  vanish 
at  either  or  t2>  Thus,  the  assertion  of  Corollary  1 follows. 


Corollary  1.2.  f - p*  attains  its  norm  exactly  n + 1 times  in 

tti»t2]  ■ 

Lemma  1.2.  If  f is  as  in  Theorem  1,  then  E (f,k)  > E (f,k+l)  . 

■-*--1.  .u  II  ■ U 009TX  CO,n 

Proof.  Let  t < ...  < t be  a balanced  (hence  optimal)  error 

v K. 

partition*  Then  by  adding  a new  break  point,  say  t,  between  a and 
t^,  we  note  that 

E»fn(f’(a’T))  < E-,n<£*<a»t2>>  = Eco,n(f’k)  > E»,n<f »(T»tl)> 

by  Corollary  1.  Hence  E (f , (t,  t,+e., ) ) < E (f,(a,t^))  for  small 

00 , n 1 1 n z 

enough  positive  e^.  But  then,  appealing  to  Corollary  1 again, 

°°,n  1 i z oo,  n i 

and  so  there  exists  an  > 0 such  that 

E»,n<fl,(tl+el't2+e2))  • 

Continuing  in  this  manner,  we  produce  a new  partition 
^t0’T,tl+el’  *'■’  tn_i+en-i»tn^  = 77  so  that 


E (f,k+l)  < E (f,ir)  < E (f,k)  . 

00,n  * — eon  CD.  n 


This  completes  the  proof. 


144 


Lemma  1.3.  The'  only  optimal  partition  is  the  balanced  error  partition, 
if  f is  as  in  Theorem  1. 

Proof . Let  u = (tQ,  ...»  t^)  be  an  optimal  balanced  partition  for  f. 

Suppose  that  there  is  another  partition  un  = (T  , T.  ) which  is  also 

optimal  for  f.  Then  by  Corollary  1,  it  follows  that  x,  _<  t , . . . , T 

1 JL  K“1 

t.  ,*  Reapplying  Corollary  1,  one  obtains  that  T,  - >_  t,  t > t . 

This  proves  the  lemma . 


Proof  of  Theorem  1.  Lemma  3 guarantees  the  existence  of  a unique 
optimal  mesh,  namely  the  balanced  error,  partition  u = (tq>  t^)  • 

Since  ou  each  interval  [t\>  t^+^]  i - 0,  k - 1 the  best  approximating 

polynomial  of  order  n is  unique,  the  best  piecewise  polynomial 
approximation  p*  is  unique.  We  require  p*  to  be  right  continuous  at 
each  to  make  p*  well  defined  at  the  knots.  The  proof  is  completed. 

The  following  is  a consequence  of  Theorem  1. 

Corollary  1.3*  If  f satisfies  the  hypotheses  of  Theorem  1 and  k _>  1, 


then  the  (unique)  best  approximation  to  f from  P_  is  continuous  if  and 

Iv 

only  if  n is  even. 

Proof.  By  Lemma  1,  the  error  j|  f - p*  j]  is  attained  at  the  knots. 
Thus,  one  only  needs  to  check  whether  p*  I r , and  P*  I r i 

Lti-l’iJ  lV  i+lJ 

match  up  at  t^  or  are  on  opposite  sides  of  f. 


3.  NUMERICAL  PROCEDURES.  In  order  to  find  the  optimal  knots  of  the 
best  piecewise  polynomial  approximant  to  f,  we  tried  several  variants  of 
schemes  suggested  in  [4].  The  method  suggested  there  is  a fixed  point 

iteration  scheme.  More  specifically  let  = (u^m^  * . u^)  be 


the  partition  at  the  mth  step  in  the  iteration  procedure*  One  then  may 
adjust  the  knots  according  to  many  criteria.  We  compare  three  methods. 


(I) 


u; 

x 


(mfl)  „ (m) 


= u 


+ C[E  (u . , u - ) ■"  E (u, 

c »>n  i+l  ®,n  i“l  i 


1 = 1,  . . . , k - 1, 

(II)  Set:  A«  - (1/i) 

_ (m+1)  (m)  . . , . (m)  . (m) . 

Then  set:  u.  y = u.  + c.i(A.  - A.  ) 

i i i k i 

i = X,  ....  k - 1 

(III)  Set:  B<m)  = (1/i)  l.  n E (u(raJ,ufm)) 

i Lj=n-x-H  <»,n  j-1’  j 
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Set; 


V«  . UW  + Cii(-B*  + B*> 

Z<“>  - ui  + cli(Aim>  - Ai(B>) 

i - 1,  ....  k - 1 

Set:  - Cv<°>  + z<n))/2  ; 1 - 1. 


k - 1, 


In  all  the  above  schemes,  it  is  assumed  that  the  numbers 

i - 1,  k - 1 are  positive.  It  is  clear  that  the  balanced  error 

solution  is  a fixed  point  of  all  the  above  schemes*  In  [4]  it  is 
shown  that  these  schemes  will  converge  to  a fixed  point  provided  the  c^ 
are  chosen  small  enough. 

Specifically,  if  ■ c < (1/M)[u^  - uQ]  where  M is  the 

Lipshit*  constant  of  E (t1f  •),  then  the  methods  I,  IX,  and  III 

n 1 

converge.  However,  the  choice  of  the  c^  is  delicate  since  too  small  a 
will  converge  very  slowly  and  too  large  a will  fail  to  converge. 

We  found  that  a convenient  conservative  estimate  for 
c^,  or  is  | (p*  - f ) 1 (u^)  ] ^ where  p*  is  the  best  polynomial 
approximate  to  f on  (u^u^^). 

A aimple  Remes  exchange  algorithm  is  used  to  compute  an  approximation 
to  the  best  polynomial  approximant  to  f on  each  of  the  subintervals. 

Our  experience  has  been  that  method  III  generally  produces  the  best  results 
This  should  not  be  surprising  since  it  adjusts  the  knots  according  to  the 
global  error  (as  does  II)  and  secondly,  it  is  not  dependent  on  whether  one 
starts  from  the  right  or  left  endpoint  as  is  method  II, 

The  following  table  typically  illustrates  the  behavior  of  the  methods. 

This  table  represents  the  results  in  trying  to  approximate  F(x)  ■ J xj  on 
[-3,2]  by  piecewise  linear  polynomials. 


Method 

Number 
of  Knots 

Iterations 

Error 

I 

1 

37 

2.556 

II 

1 

201 

2.561 

III 

1 

91 

2.556 

I 

2 

106 

1.2929 

II 

2 

201 

1.2935 

III 

2 

117 

1.2929 

I 

3 

201 

0.6367 

II 

3 

247 

0.6376 

III 

3 

83 

0.6377 

I 

4 

420 

0.42S7 

II 

4 

337 

0,4274 

III 

4 

102 

0.4303 
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These  results  are  quite  representative  of  the  relationships  between 
the  methods.  In  general , for  1 knot  (independent  of  degree)  method 
I appears  to  be  much  faster  than  II  or  III.  Method  III  nearly  always 
beats  method  II  and  after  1 or  2 knots  easily  outdistances  the  other 
methods. 

The  authors  would  like  to  express  their  gratitude  to  Robert 
Strader  who  did  much  of  the  programming* 
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Linear  Generalizations 


of  the  Gronwall -Reid-Bellman  Lemma 


Jagdish  Chandra  Paul  Wm.  Davis 

U.  S.  Army  Research  Office 
Research  Triangle  Park,  North  Carolina 


An  investigation  of  uniqueness  or  continuous  dependence  on 
boundary  data  for  the  system  of  nonlinear  equations, 


0)  wxy  = F(x,  y,  w) , 

w prescribed  on  x = x°,  y = y°, 


where  w is  a vector  and  F is  a matrix  which  is  Lipschitz  con- 
tinuous in  w,  reduces  to  a study  of  integral  inequalities  of  the 
general  form 


(2) 


u(x,  y)  £ a(x,  y)  + G(x,  y) 


H($,  t)u(s,  t)  ds  dt. 


Here  a is  a vector  determined  by  the  boundary  conditions  and  G 
and  H are  continuous  matrices  with  nonnegative  entries. 

Inequality  (2)  is  an  analog  for  systems  in  several  independent 
variables  of  the  Gronwall -Reid-Bellman  inequality  used  in  the  theory 
of  ordinary  differential  equations,  e.g.  [P.  Hartman,  Ordinary  Dif- 
ferential Equations,  Wiley,  p.  24],  By  manipulating  the  resolvent 
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kernel  of  the  integral  operator  on  the  right  side  of  (2),  we  have 
obtained  the  upper  bound  required  to  establish  uniqueness,  continuous 
dependence,  etc,  for  the  system  (1)  of  partial  differential  equations: 


Theorem:  If  u(x,  y)  satisfies  (2),  then 


u(x,  y)  < a(x,  y)  + G(x,  y) 


rx  ry 


X y 


V(x,  y;  s,  t)H(s,  t)a(s,  t)dsdt, 


where  V is  defined  by 


V(x,  y;  s,  t)  = I + 


H(p,  q)G(p,  q)V(p , q;  s,  t)  dp  dq. 


s t 


The  restriction  to  merely  two  variables  is  not  essential. 


The  proof  of  this  theorem  is  simpler  and  the  result  more  general 
than  that  of  D.  R.  Snow  [Proc.  Amer.  Math.  Soc.  33^  (1972),  46 -54]  and 
E.  C.  Young  [ibid.  41  (1973),  241-244],  who  used  a differential-initial- 
value  approach.  In  several  special  cases,  the  matrix  V can  be  com- 
puted explicitly,  thereby  recovering  a number  of  earlier  results, 
including  the  classical,  single-variable,  scalar  lemma  of  Gronwall, 

Reid,  and  Bellman, 

The  discrete  analog  of  (2),  which  arises  in  the  numerical  solution 
of  the  system  (1)  of  partial  differential  equations  by  Euler's  method, 
can  be  treated  similarly. 
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A METHOD  FOR  THE  NUMERICAL  SOLUTION  OF  TWO-POINT  BOUNDARY 
VALUE  PROBLEMS  BASED  ON  THE  USE  OF  VOLTERRA  INTEGRAL  OPERATORS 

H,  Fujita 

University  of  Tokyo,  Japan 
L.  B.  Rail 

Mathematics  Research  Center 
University  of  Wisconsin- Madison 

§0.  Abstract.  Suppose  that  a given  two- point  boundary  value  problem  to 
be  solved  is  a perturbation  of  one  for  which  a considerable  amount  of  analytic 
information  is  available,  in  particular,  the  Green's  function  of  the  unperturbed 
problem  is  known.  In  this  case,  one  can  construct  an  analytic  method  for  the 
solution  of  the  perturbed  problem  which  involves  only  the  solution  of  a Volterra 
integral  equation.  Discretization  of  this  method,  which  is  a version  of  the 
classical  "shooting"  procedure,  leads  to  a numerical  technique  foi  the  solution 
of  perturbed  boundary  value  problems.  Under  suitable  assumptions,  convergence 
of  the  numerical  method  will  be  established,  and  estimates  will  be  obtained  for 
the  rate  of  convergence.  Attention  will  be  devoted  to  the  cases  in  which  the 
unperturbed  differential  operator  is  regular  or  mildly  singular.  Linear  problems 
will  be  considered  first,  followed  by  an  extension  of  the  method  to  the  non- 
linear case. 


AMS(MOS)  Subject  Classifications  - 34B05,  34B15,  45B05,  45D05,  45GQ5, 

45G99,  45L05,  45L10,  6 5R0  5 

Key  Words  - Boundary  value  problems,  Fredholm  integral  operators,  Volterra 
integral  operators,  Nonlinear  integral  equations,  Numerical 
solution  of  integral  equations 

§1.  Preliminaries.  This  section  is  devoted  to  some  basic  information  about 
regular  Sturm -Liouvillle  boundary  value  problems.  For  more  details,  the  books 
by  Ince  [3]  or  Yosida  [10]  may  be  consulted. 

1.1.  The  regular  Sturm -Llouville  operator.  For  the  sake  of  definiteness, 
attention  will  be  confined  to  boundary  value  problems  posed  on  the  real  interval 
I = [0,1].  The  symbol  LQ  will  denote  the  formally  self-adjoint  Sturm -Llouville 
operator  defined  by 


(1.1) 


V = - 


d_ 

dx 


P(x) 


+ Q(x)  <p 


for  all  sufficiently  smooth  functions  <p  = <p(x),  where  P,Q  are  smooth,  real 
valued  functions  of  x on  [0,1],  and  a positive  constant  6 exists  such  that 
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(1.2) 


P(x)  > 6 > 0,  xe  I. 


The  boundary  conditions  to  be  considered  in  this  part  of  the  paper  are  the 
homogeneous  conditions 


(1.3) 


or 


(1.4) 


u(0)  = 0,  u(l)  = 0, 


u'(0)-<r0u(0)  = 0, 
u'fD-o-jUd)  = 0, 


for  positive  o-Q,  cr^,  unless  some  other  conditions  are  explicitly  stated. 

In  order  to  construct  the  Green's  function  and  corresponding  integral 
operator,  suppose  that  cp  = <p(x),  4 1 = 41  (x)  are  functions  such  that 


(1.  5)  L0<p  = 0,  <p(x)  satisfies  the  boundary  conditions  at  x = 0, 

(1.6)  L 4i  = 0,  4j(x)  satisfies  the  boundary  conditions  at  x = 1, 


and  their  Wronskian 


(1.7) 


W(<?,4i)  = 


<p(x)  ip(x) 

<p'(x)  ip'(x) 


does  not  vanish  on  the  interval  [0,  1]  . The  existence  of  functions  <p,^ 
satisfying  conditions  (1.  5)  - (1.7)  is  equivalent  to  the  existence  of  a 
bounded  inverse  of  the  differential  operator  LQ  subject  to  the  given  boundary 
conditions  in  some  space  such  as  C(I)  or  L (I),  I = [ 0,  1 ] . Further,  as 
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it  follows  that 


t w(”-+>  = • w w<^>> 


P(x)W(<p,i|j)  = constant 


for  xe  [0,1] , and  this  constant  value  may  be  taken  to  be 


(1.8) 


mn  = p(xn)w(‘i0>llJ) 
u u xQ 


for  some  € [ 0 , 1]  . 

The  functions  <p, 4/  may  be  used  to  construct  the  Green's  function 
G(x,y)  for  the  differential  operator  and  associated  boundary  conditions. 
One  has 

r 1 

— <p(x)4i( y),  x < y, 

(1.9)  G(x,y)  - J 0 

-J^4j(x)<p(  y),  x>y, 

v.  0 


where  mQ  = 

operator  r|— 
mQ 

that  m0  = 1, 


P(y)W(<p,4j)y  is  independent  of  y.  By  considering  the 

instead  of  Lq,  one  can  assume  without  loss  of  generality 
and  the  Green's  function  is  of  the  form: 


<p(x)4j(y),  x < y , 

(1.9)'  G(x,y)  = < 

v|j(xMy),  x > y . 

The  function  G(x,y)  is  evidently  continuous  in  the  square  I X I, 
However,  it  is  not  regular  at  x = y.  In  I X I - A,  where  A = {(x,x)  |xe  1} 
is  the  diagonal  of  I x I;  the  smoothness  of  G is  determined  by  the  differ- 
entiability of  (p  and  ijj.  In  particular,  if  PeCa+^(I)  and  Qe  CQ(I)  for 
some  a > 0,  then  Ca"*"^(I).  The  following  lemma  will  be  useful  later. 
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Lemma  1.1.  Let  Lq  be  a regular  Sturm -Liouville  operator  associated 


with  certain  boundary  conditions . Assume  that  L^  exists,  where  Lq"1  is 
given  by  the  Green  operator  G,  which  is  the  integral  operator  defined  by 

1 

«Bf)(x)  = 5 G(x,y)f(y)dy, 

0 

the  kernel  G(x,y)  being  the  Green's  function.  Moreover,  assume  that 
PeC“+l(I)  and  QeC“(I)  for  some  a^O,  Then 


G = L’1  : CP(I) 


CP+2(I) 


for  any  (3  such  that  0 < P ^ 

1.2.  Decomposition  of  the  operator  G . It  will  be  shown  that  the 
Green  operator  G may  be  expressed  as  the  sum  of  a certain  Volterra  operator 
W,  associated  with  LQ>  and  an  operator  of  rank  one.  To  construct  the 
Volterra  operator,  consider  the  following  initial  value  problem: 


(1.10) 


L U(0)  = u ' (0 ) = 0 . 


The  solution  of  this  problem  may  be  expressed  in  the  form 


(1.11) 


x 


u(x)  = / V(x,y)f(y)dy  =:  (Vf)(x)  . 
0 


An  explicit  representation  for  V(x,y)  can  be  given  if  one  knows  two 
solutions  of  LqU  =i  0 which  are  independent  in  the  sense  that  their 

Wronskian  does  not  vanish.  The  standard  way  to  do  this  by  the  variation  of 
parameter  formula  [10]  is  to  assume  that  V(x,y)  has  the  form 


V(x,y)  = 


0 , x<y , 

»<p(x)  + P4dy)  = h (x),  y < x , 


and  then  determine  a - o?(y),  p - P(y)  so  that 


f hy(y)=0, 

V-p(y)hy(y) 
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that  is,  so  that  LqIi  = 6(x-y)  in  terms  of  the  Dirac  6-function  [ 2] . This 
leads  to  the  equations 


Cup  + (3qj  = 0 , 


cup'  + fkjj1 


i 

P ’ 


for  a,  (3,  and  the  corresponding  solutions 


«(y) 


o ij; 


" p + ' 


jiM. 


MxL 


W(<p,4j)  P(y)W  (<P,H 


m. 


(3(y)  = - 


<eM 


m. 


This  gives 


(1.12) 


V(x,y)  = / 


^mo 


» x<y  , 


[ <^(x)4i(y)  - <p(y)4j(x)]  , y < x , 


where 

1 T 


m0 
m„  = 


0 


mQ  is  the  constant  defined  by  (1.8).  As  before 
could  be  considered  in  place  of 


, the  operator 
and  one  can  thus  assume  that 


without  loss  of  generality. 

Remark  1.1.  It  is  possible  to  make  an  alternative  derivation  of  (1.12). 
Suppose  that  feC[0,l]  and  ceR*  is  a real  number.  It  follows  that 
u = Wf  + c<p  is  a function  satisfying  L^u  = 0 and  the  boundary  condition 
at  the  left  endpoint  x = 0.  The  method  of  "shooting"  [4]  consists  of  the 
determination  of  the  value  of  c for  which  Vf  + c<p  is  equal  to  (Ef.  It  is 
easy  to  see  that  the  value  c = c(f)  determined  in  this  way  is  a linear 

R1.  Thus,  by  the  Riesz 
exists  such  that 


functional  of  f which  is  continuous  from  L,  into 


representation  theorem  f 2,  p.  20] , an  element 
c(f)  = (f,4^).  The  relationship 


^1£  L2 
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(1.13) 


Vf  + (f,q;  )?»  = ffif 


for  all  fe  is  equivalent  to 

(1.14)  V(x,y)  + <p(x)ifij(y)  = G(x,y) 

for  all  x,y  in  [0,1].  If  x<y,  this  implies  that 

<p(x) '4/j(y)  = G(x,  y)  = <p(x)4dy), 


and  thus 


^(y)  = »v(y)  • 

Using  the  symmetry  property  of  G(x,y),  one  gets 

V(x,y)  + <p(x)4j( y)  = G(x,y)  = i|i(x)<?(y),  x > y, 

hence 


V(x,y)  = i|j(xMy)  - «p(x)4;(y)  . 


Remark  1.2.  It  follows  from  the  preceding  observation  that 


(1.15) 


ffif  = Wf  + (f,i| i)<p  , 


where  <p, 4»  are  the  functions  appearing  in  the  representation  (1.9)'  of  G(x,y). 

Remark  1.3.  Equation  (1.15)  shows  that  the  Green  operator  <E  , which 
is  an  integral  operator  of  Fredholm  type,  is  obtained  from  the  Volterra  operator 
\V  by  a perturbation  of  rank  one.  In  this  sense,  \V  is  close  to  (E  . However, 
\V  and  (G  differ  in  one  essential  respect:  (E  - X\V)~*  exists  as  a continuous 
operator  in  C(I)  or  L^fl)  for  all  X (even  complex  X),  whereas  (E-X(E)_1 
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does  not  exist  if  X is  an  eigenvalue  of  the  operator  (G  . (I  denotes  the 
identity  operator  in  the  space  considered.)  This  fact  will  be  exploited  later. 


1 -3.  Vol terra  Integral  operators.  Some  well-known  properties  of 
Volterra  integral  operators  will  be  summarized  here  for  later  reference.  It 
will  be  supposed  that  the  operator  V .is  represented  by  a kernel  V = V(x,  y) 
which  is  continuous  on  the  triangle  {(x,  y)|o  <y<x<l}.  One  sets 
V(x,  y)  = 0 for  y > x.  Furthermore,  let 

(1.16)  (x  = max  |v(x,  y)|  . 

0<y<x<l 

The  nth  iterate  V11  of  the  operator  V will  have  the  kernel 
V(n)(x,y)=  / V(n_1)(x,t)V(t,y)dt,  y<x, 

y 

with  V^(x,  y)  = V(x,  y).  The  following  result  is  easily  established  by 
mathematical  induction. 

Lemma  1.2.  For  M defined  by  (1.16), 

(1.17)  lv(n)(x,y)l<^^^,  y<x, 

n = 1,  2 where  V n (x,  y)  is  the  nth  iterated  kernel  of  V(x,  y).  In 

particular, 


(1.18)  11^11  <^, 

n ! 

where  II  ||  denotes  the  operator  norm  on  C(I)  or  L^(I). 
The  iterated  kernels  are  defined  to  be  zero  for  y > x. 
Corollary  1.2.1.  The  Neumann  series 

(1.19)  ( I-V)"1  =I  + V + V2  + -’*  + Vn  + -«- 
converges  in  the  operator  norm,  with 

(1.20)  ||(l-vr1  ||  < e11  . 
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Remark.  1.4.  If 


(1-21) 

then 

(1.  22) 


I V(x,  y)  | < ^(x-y),  y < x , 


lv(n)(x,y)  | < ^ , Y<x, 


n = 1,2,...  . 

In  this  case,  the  estimate  (1.  20)  can  be  replaced  by 
(1.  22)'  Jl(H-V)  ^ ||  < coslW|ji  . 


For  Volterra  operators  arising  from  a regular  Sturm-Liouville  operator  L^, 
(1.21)  and  consequently  (1.22)  - (1.22)'  hold. 


1.4.  Degenerate  perturbation  of  Volterra  integral  operators.  Consider 
a Volterra  integral  operator  V with  kernel  V(x,  y)  as  defined  previously, 
and  a degenerate  operator  $ of  finite  rank  m defined  by 


m 

Su  = Z * 

j = l J J 

where  , ijj • are  given  linearly  independent  functions  in  C[0,1].  The 
operator  ■*  ^ 


K = V + S, 


is  called  a degenerate  perturbation  of  the  Volterra  integral  operator  V.  The 
following  notation  will  be  useful:  The  operator 

JJ  = (n  - v)"1 


always  exists  by  the  previous  results, 
continuous,  one  may  define  the  matrix 


For  <p  ib,  i,j 

A = (a, .)  by 
ij 


a. , 
U 


(4^  • 


1,2,.. 


m 
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The  letter  d will  be  used  to  denote  the  determinant  d = det(I  -A), 
where  I is  the  identity  matrix  of  order  m.  The  following  theorem  is 
a speciarcase  of  a famous  result  for  linear  integral  equations. 


Theorem  1.1  (Fredholm  Alternative).  Consder  the  equation 
(1.  2 3)  u - Ku  = f,  f € X , 

where  X is  either  C( I)  or  L2(I).  Either  (i)  d * 0,  and  (1.23)  is 
uniquely  solvable  for  any  f e X,  or  (ii)  d = 0,  in  which  case  the 
homogeneous  equation 


(1.  24)  uQ  - KuQ  = 0 

has  nontrivial  solutions.  (That  is,  1 is  an  element  of  the  point  spectrum 
<r  (K),  in  other  words,  an  isolated  eigenvalue  of  K.) 

Proof.  Since  It  - V and  JJ  are  both  continuous,  equation  (1.23) 
is  equivalent  to 


(1.25) 


u - JJSu  = JJf  . 


By  definition, 


m 

]J$u  = Yj  (u,  dj  )JJy>,  • 
J=1  J J 


Setting  y,  = (u,«|/),  j = 1,  2,...,m, 


(1.25)  becomes 


(1.26) 


and  since 


one  has 


(1.27) 


If  , 


m 

i 

i=i 


yM.tUv.)  - (Iff,  4-J 

J ,1  J i 


m 

E atix.  - (Jf,^)  , 

j=l  J J 
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i = 1,  2,  . . . , m.  If  d * 0,  then  (1,  27)  determines  the  values  of 

■y p -y2,  . . . , Ym  uniquely,  and  the  unique  solution  u of  (1.  2 3)  .is  determined 

correspondingly  from  (1.  26),  that  is 

nr1 

(1.28)  u = l y.JS<p.  + ]Jf  • 

J=1  J J 

If  d = 0,  then  the  homogeneous  equation 


(1.  29) 


m 


j=l 


a. . Y. 

U ) 


0 


will  have  nontrivial  solutions  belonging  to  some  r-parameter  family. 
Corresponding  to  this,  (1.  24)  will  have  the  r-parameter  family  of  nontrivial 
solutions 


m 


(1.30) 


u. 


= Z 

i=i  1 ‘ 


§ 2 ■ Regular  Sturm- Liouville  problems  perturbed  by  lower  order  terms . 

The  operator  will  be  considered  to  be  a regular  Sturm-Liouville  operator 

with  associated  boundary  conditions  of  the  type  considered  in  §1.  Here, 
attention  will  be  devoted  to  equations  of  the  form 


(2.  ]) 


-LqU  + p(x)  “ + q(x)u  = fj  , 


subject  to  the  boundary  conditions 


(2.  2), 


u(0)  = u( 1 ) = 0 , 


or 


(2. 2), 


u»(0)  - <r0u(0)  = 0 , 
u'(  l)  - 0-  J u(  l)  = 0 , 


where  <r  , <r.  > 0 are  given  constants.  The  differential  operator  M is 
defined  by 


(2.  3) 


. , du  . 

Mu  = p ~ + qu 
dx 
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Equation  (2.  l)  will  be  dealt  with  from  the  standpoint  that  the  operator  LQ 
is  perturbed  by  the  operator  M of  lower  order. 

The  functions  p,q  entering  into  (2.  3)  are  assumed  to  be  smooth. 
In  particular,  if  P e Q e Ca,  a > 0,  and  one  assumes  likewise 

that  p e Ca+*,  q e Ca,  then  it  follows  from  f^  e that  u e C^+2 
for  any  |3  such  that  0 < |3  <a,  where  u is  any  solution  of  (2.1). 

Equation  (2.1)  and  the  corresponding  boundary  conditions  may  be 
transformed  into  equivalent  integral  equations  by  means  of  the  Green's 
operator  G = and  the  corresponding  Vol terra  operator  V = G - ( , 

as  defined  in  §1.  First  of  all, 

(2.4)  u - (EMu  = f,  where  f = -Gf^  . 

This  is  equivalent  to  (2.1)  and  (2.  2)Q  or  (2.  2)^.  Next, 

(2.5)  u - VMu  - (Mu,i|j)(p  = f , 

which  is  equivalent  to  (2.  4)  and  hence  to  the  original  boundary  value 
problem.  By  using  the  definitions  of  G and  V,  it  can  be  checked 
directly  that  any  solution  u of  (2.4)  or  (2,  5)  will  satisfy  the  boundary 
conditions,  and  also  the  differential  equation  (2.1). 

Before  a general  treatment  of  equation  (2.5)  is  given,  two  special 
cases  will  be  considered. 


Case  I.  p = 0.  Here,  one  sets 


(2.  6) 


C V(x,  y)  = V(x,  y)q(y)  , 
t|j(y)  = q(y)4j(y)  > 


and  obtains  the  corresponding  equation 

x 

(2.7)  u(x)  - f V(x,  y)u(y)dy  - (u,  l|j)<p(x)  - f(x)  . 

0 


In  this  case,  V(x,  x)  = 0.  One  has: 

Proposition  2.1.  Equations  (2.  5)  and  (2.7)  are  equivalent. 

Casell.  The  boundary  conditions  (2.2)^  hold.  By  integration  by  parts, 
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x y-x  x 

/ V(x,  y)p(y)  ~ dy  = V(x,  y)p(y)u(y)  -/  — {V(x,  y)p(y)  }u(y)dy 


'y=o  o 


ay 


= “ / fr  (V(x,  y)p(y)}u(y)dy  , 
0 Y 


since  the  first  term  vanishes  at  y = x because  V(x, x)  = 0,  and  at 
y = 0,  one  has  u(0)  - 0.  Also, 


/ p(y)  4^(y)dy  = p(y)u(y)4»(y)  - / u(y){p(y)ijj(y)}'dy 


dy 


1 

= - f u(y)(p(y)4i(y) }'dy  , 

0 

since  u(0)  = u(l)  = 0.  Thus,  for 

V(x,y)  = - ~~  {V(x,  y)p(y) } + V(xf  y)q(y)  , 

My)  = ~ {p(y)iKy)}'  + q(y)^(y)  , 


X ^ 

u(x)  - f V(x,  y)u(y)dy  - (u,  J)<p(x)  = f(x)  , 

0 

an  equation  of  the  same  form  as  (2.7).  From  the  above  derivation,  one  has: 

Proposition  2.2.  Equations  (2.5)  and  (2.9)  are  equivalent. 

Case  III.  Reduction  of  the  general  case  to  a system  of  simultaneous 
equations.  This  is  similar  to  the  standard  procedure  for  the  conversion  of  a 
differential  equation  of  arbitrary  order  into  a system  of  first  order  equations. 
One  sets 


(2.8) 

one  obtains 
(2.9) 


v 


du 
dx  ’ 


w 


Equation  (2.  5)  may  be  written  in  terms  of  u and  v as 
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X 

u(x)  - f V(x,  y) (p(y)v(y)  + q(y)u(y)}dy  - (pv  + qu,  ^)^(x)  = f(x)  , 

o 

since  Mu  = pv  + qu.  Differentiating  this  equation  with  respect  to  x 
and  making  use  of  the  fact  that  V(x,  x)  = 0 gives 

x 

v(x)  - f v (x,  y){p(y)v(y)  + q(y)u(y)}dy  - (pv  + qu,^)?'(x)  = f'(x)  , 

0 a 

' where  V^x,  y)  - ~ V(x,  y)  . 

The  two  equations  above  can  be  combined  into  a single  vector  equation  for 
w.  Set 


V(x,  y) 


4*(y)  = 


q(y)^(y)\ 
p(y)4j(y)  i 


V(x,  y)q(y) 

V(x,  y)p(y)  \ 

V2(x,  y)q(y) 

V^x,  y)p(y)  { 

9 

Mx)  \ 

, 5(x)  = , f(x)  = 

\<P'(x)/ 

I f(x) 
\f'(x) 

Then 


X 


w(x)  - f V(x,  y)w(y)dy  - <w,  J)^(x)  = f(x)  , 
0 


(2.10) 

where  (w,i|;)  denotes  the  inner  product 


1 

<w,  J > = f (u(y)  v(y)) 
0 


( q(y)4^(y)  \ 

dy 

y p(y)4j(y)  / 


The  following  result  is  evident. 

Proposition  2.3.  Equations  (2.5)  and  (2.10)  are  equivalent. 

Case  IV.  Another  treatment  of  the  general  case.  If  one  sets 
w = Mu,  then  equation  (2.  5)  becomes 


(2.11) 


u - Vw  - (w,  if<)^  = f . 
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Operating  on  (2.11)  with  M gives 


w - MVw  - (w,  - Mf  . 


Thus,  for 

V(x,y)  = p(x)  ~ v(x,  y)  + q(x)V(x,  y)  =:  MxV(x,y)  , 

{ <p(k)  = M<p(x)  = p(x)  ^ + q(x)cp(x)  , 

^ f(x)  = Mf(x)  = p(x)  ^ + q(x)f(x)  , 

one  has  the  equation 

(2.12)  w - Vw  - (w,  ip)<p  = f , 

which  is  of  the  type  (2.5).  In  order  to  recover  u from  a solution  w of 

(2.12) ,  it  is  not  necessary  to  solve  the  first  order  differential  equation 
w = Mu  for  u.  Instead,  one  has  directly  from  equation  (2,11)  that 

(2.13)  u = Vw  + (w,  ijj)<p  + f , 
and  thus: 


Proposition  2.4.  Equation  (2.  5)  is  equivalent  to  the  system  of 
equations  (2.12)  and  (2.13). 

In  general,  then,  one  only  has  to  solve  an  integral  equation  of  the  form 

(2.14)  u - Vu  - (u,ij;)<p  = f , 

where  V is  a regular  Voiterra  integral  operator  and  <p^  t|j,  and  f are 
given  smooth  functions.  In  order  to  solve  (2.14),  put  JJ  = (I  - V)  , which 
always  exists.  It  follows  that  (2.14)  is  equivalent  to 


(2.15) 

or  to  the  system 

(2.16) 


1%/  /W  M 


u - (u,  <p)y<p  = jf  , 


u = \If<p  + Jf  , 

[i  -(K,J)b  = (Jf,5) , 


for  v = (u,  4»).  Thus,  after  JJ  <p  and  jf  have  been  formed,  one  finds 
the  solution  u of  (2.15)  directly  from  (2.16),  provided  that 


(2.17) 


nj  rv  ru 


1 - * o 
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§3.  Numerical  solution.  The  solution  of  an  integral  equation  of  the  form 


(3.1)  u - Vu  - (u,  i|j)(p  = f , 

where  V(x,  y),  <p(x),  i|j(x),  and  . f(x)  are  given  smooth  functions,  may 
be  broken  down  into  the  following  steps.  First,  one  solves  the  linear 
Volterra  integral  equations 

(3.  2)  $ - V<£>  = <p,  F - VF  = f 

for  the  functions 

(3.3)  $(x)  = JV(x),  F(x)  = Jf(x)  . 

The  inner  products 


1 1 

(3.4)  ($,4*)  = / $(x)i|;(x)dx,  (F,  4j)  = / F(x)i)j(x)dx  , 

0 0 

are  then  calculated,  which  leads  to  the  scalar  equation 

(3.5)  [1  - (*,^)Jy  = (F,4.) 
for  the  constant 

(3.  6)  y = (u,  ijj)  . 

Assuming  that  1 - ($,41)  * so  that  (3.5)  is  uniquely  solvable,  one 
obtains  the  solution  u(x)  of  the  original  equation  (3.1)  in  the  form 

(3.7)  u(x)  = F(x)  + «(*)  - 

In  general,  one  would  not  expect  to  obtain  explicit  representations 
for  the  functions  F(x),  4>(x),  and  the  integrals  appearing  in  (3.4).  This 
leads  to  the  consideration  of  a number  of  procedures  for  obtaining  approximate 
solutions.  For  example,  since  the  Neumann  series  for  JJ  = (U  — V)”-1  is 
convergent,  it  is  natural  to  try  to  obtain  the  values  of  (3.  3)  and  (3.  4)  in 
terms  of  infinite  series.  Another  approach,  the  one  which  will  be  considered 
in  this  paper,  is  to  use  numerical  integration  (Nystrom's  method)  for  the 
solution  of  the  integral  equations  (3.  2)  and  the  evaluation  of  the  integrals 
(3.  4).  (It  is  not  necessary  to  use  the  same  rule  of  numerical  integration  for 
each  purpose.)  Once  again,  the  smoothness  of  the  functions  involved  leads 
to  the  possibility  of  obtaining  fairly  accurate  results  by  numerical  integration. 


165 


This  is  in  contrast  to  what  may  happen  if  the  Nystrom  method  is  applied  in 
a straightforward  fashion  to  the  Fredholm  integral  equation 

(3.8)  u - (Eu  = f , 

as  the  singularity  in  — G(x,  t)  at  x = t may  affect  the  accuracy  of  a 
given  procedure  for  numerical  integration  [1,7]. 

Convergence  of  this  method  of  numerical  solution  will  be  proved 
under  fairly  mild  restrictions  concerning  the  quadrature  formulas'  to  be 
used.  It  will  be  assumed  that  the  interval  [0,1]  is  partitioned  into  N 
subintervals  by  means  of  the  points 

0 = ■■■  < SN-1  4 = 1 • 

It  is  not  essential  that  these  subintervals  be  of  equal  length;  however, 
the  lengths  of  these  subintervals  should  go  to  zero  uniformly  as  N -* 

As  usual,  one  sets 

(3-9)  </>.  = ‘Pdi)>  ^ = 4J(il)  > 

i = 0,1,  . . . , N,  and  one  seeks  approximations  u^  to  u(£.)  byreplacing 
(3.1)  by  a finite  linear  system  of  equations.  This '’discretization  is 
accomplished  by  the  introduction  of  quadrature  formulas 

lnX  . N . v «N  N 

(3.10)  <u  , 4*>  = L C u ^ 

j = 0 J J J 

to  approximate 

1 

(u,  4<)  = / u(x)i(j(x)dx  , 

0 

and 


(3.11) 


as  approximations  to  the  integrals 
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N N 

i - 0, 1,  . . . , N.  The  values  of  C.  , C, , depend  on  the  specific  numerical 
integration  rules  used.  This  leads  to  the  system 


(3.12) 


N 


z 

j = 0 


N N 
K.  u. 
U J 


/ N . 

'U  ’*Vi 


N 


.1  - 0,1,...,N,  for  u"  - (u  , u . . . , u^).  The  prescription  for  solving 
this  finite  system  is  essentially  the  same  as  for  the  integral  equation  (3.1). 
As  in  [ 7 J , one  solves  the  lower  triangular  systems 


i 


(3.13) 


N V „NrN 


i 


N v ,,N  _ N 

1 jfo  1J  > *'  1 * * * i?o  11  1 ~Vl' 


F - 


U 


N W IV  V'  i\  IN 

K F - f.,  $ - >,  K ; 4 

111  1 1 1 imJ  11  1 


■NT  f t,N  /t,N  t,N,  N , N N 

i = 0,l,...,N  for  F = (F0,F1 

forms  the  inner  products 


,$N),  and 


(3.14) 


y 


<fn,4j)n=  i c"r;>4  , 


N 

S' 

L j 

j =0 

N 


V N N 
- > C F \ 

J 1 J 


<*  »+>N  = Z cj  *j>j 
V j=0  1 J J 


Then,  if 
(3.15) 


1 - <$N,  4^>n  t 0 , 


the  solution  of  (3.12)  is  given  by 

(3-16) 


N N N N 
u - F + y <b  , 


where 


(3.17) 


N 

V 


<FN,^)N 
1 - (<tN,p 


N 


Notice  that  it  is  not  necessary  to  evaluate  the  inner  product  \ - (u  , iJj)-. 

directly. 
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Ordinarily,  one  has  V(x,  x)  = 0,  which  implies  that  K. . - 0, 

1 = 0, 1,  . . . , N.  In  this  case,  the  amount  of  arithmetic  required  to  obtain 
the  solution  (3.16)  of  (3.12)  has  been  shown  [7]  not  to  exceed  one  division, 
i 4N  + 5 multiplications,  and  1 4N  + 4 additions.  The  following 
arguments  may  be  modified  easily  to  handle  the  case  + 0,  at  the 
possible  cost  of  J(N^  t 5N  + 5)  additional  divisions.  11This  modification 
will  be  indicated  at  the  end  of  this  section. 

The  following  assumption  will  bo  made  concerning  the  quadrature 
formulas  used  in  (3.11)  and  (3.14):  A constant  a,  independent  of  N, 
exists  such  that 


(3.18) 


a | N | _a_ 

N ’ ij  ~ N ’ 


0 < j < i < N.  Since  it  is  assumed  that 

Iv(x,  y)  I < p , 


this  in  turn  implies 
(3.19) 


U(JL 

N 


0 £.  j < 1 < N.  These  inequalities  may  be  used  to  obtain  a uniform  bound 
for  the  inverse  matrices 

(3'M)  JN  (IN  - V"1  > 

where  I denotes  the  identity  matrix  of  order  N + 1,  and  K is  the 
strictly  lower  triangular  matrix  with  elements 


(3.  21) 


r 


Vi,  = < 


K 


N 

ij 


0 < i < j < N , 
0 < j < i < N . 


For  this  purpose,  it  will  be  convenient  to  introduce  the  norm 
( 3.  22)  ||  z ||  - max  | z.  | 

(i)  1 

for  vectors  z - (z^,  z^,  . . . , z ) in  the  space  ^ of  (N+l)-dimensional 
real  vectors.  The  corresponding  bound  j| A JJ  for  real  matrices  A - (A,.)  of 
order  N + 1 is  ^ 
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N 


(3.  23) 


I A ||  = max  Yj  | A. . 

(i)  j = o l] 


It  may  be  noted  that  the  inner  product  (,  defined  by  (3.10)  may 

be  used  to  obtain  the  norm  ||  z ||  = ( z,  z)^.  on  R , provided  C,  >0, 
j = 0,1,  . . . , N.  This  norm  and  the  corresponding  matrix  norm  will  beJ 
equivalent  to  (3.22)  and  (3.23),  respectively.  For  computational  purposes, 
(3.22)  and  (3.23)  have  the  advantage  of  simplicity. 

Lemma  3.1.  Let  B = (B,,)  be  a strictly  lower  triangular  matrix  of 
order  N + 1 with  elements  ^ 


(3.24) 


i] 


0 , 0 < i < j < N , 


b..,  0 < j < i < N 


Furthermore,  assume  that  a constant  \ > 0 exists  such  that 


(3.25) 


b, . < \ , 

u 


N+l 


0 < j < i < N.  Then,  for  any  vector  z e R and  any  positive  integer  p, 

P 


(3.  26) 
i - 0, 1,  . . . , N.  Thus, 

(3.  27) 


|(BPz),  I < ~ i(i-l)  • ■ • (i-p+1) 

.1  P * 


!l  BP  II  < ^ 
~ P ! 


and 

(3.28) 


(IN-B)-‘ll  <e"N 


Proof:  Inequalities  (3.26)  will  be  established  by  mathematical 
induction  on  p.  Note  that  it  follows  from  (3.  26)  that 


(3. 29) 

for  0 < i < p.  For  p = 1, 


(B  z),  = 0 

l 


(Bz)q  = 0 , 
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and 


(Bz),  = 


b ,.z,  , 
iJ  3 


i = 1,  2,  . . . , N.  Thus 


b,  ,z, 
ij  J 


< 


‘}j  \ II  z II  = iK  II  z ||  , 


j = 0 


i = l,  2,  This  verifies  (3.  26)  for  p = 1.  Assume  that  (3.  26)  holds 

for  p = m.  Once  again,  one  has 

(Bm+1z)0  = (B(Bmz))0  = 0 . 

Also,  for  i = 1,  2,  , . . , N, 


(Bm+1z).  = (B(Bmz)).  , 2 b.,(Bmz).  , 

1 1=0  » > 


1-1 


so 


(3.  30) 


Actually, 


m 

^7  j(j-l)  ■ • • (j-m+1)  ||z  ||  . 


i’l  i-1 

2 JO-1)  '**  0-m+l)  = ^ JO-1)  •••  0-m+l)  . 

j = 0 j~m 


Another  application  of  mathematical  induction  will  be  made  to  show  that 


1-1 

^3’3^  Z JO-1)  (j-m+1)  = i(i-l)  •••  (i-m) 

j = m 

for  all  positive  integers  i > m + 1,  For  i = m + 1,  both  sides  of  (3,  31) 
reduce  to  (m+1) !,  and  thus  it  is  valid  initially,  Assuming  that  (3.  31) 
holds  for  some  positive  integer  i = n > m + 1,  one  has 
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n n-1 

(m+1)  Yj  J(j Jl)  * * ' (j-m+l)  = (m+l)n(n-l)  • • • (n-m+1)  + (m+1)  j(j-l)  • • • (j-m+1) 
j = m j = m 


= (m+l)n(n-l)  • • • (n-m+1)  + n(n-l)  ■ • • (n-m) 


= (n+l)n(n-l)  • • • (n-m+1)  , 

which  is  (3.  31)  for  i = n + 1.  This  completes  the  "inside"  induction. 
From  (3.  30)  and  (3.  31), 

. , , m+1 

|(Bm  z).  | < (m+1j'7  i(i-l)  ' ’ * (i-m)  II z || 


which  is  (3.26)  for  p = m + 1.  This  completes  the  "outside"  induction  and 
shows  that  (3.26)  holds  for  all  positive  integers  p.  Inequality  (3.  27) 
follows  immediately  from  (3.26).  Since  B^™1  = 0,  B is  nilpotent,  and 
(I^-B)--'-  always  exists  and  is  given  by  the  finite  Neumann  series 

(In-B)_1  = IN  + B + B2  + + BN  . 


Hence, 


!I(I  -B)"1 1|  < 2 

p=0 


N 


00 


B' 


- 1 «B‘ 

P-0 


< l 


OO  l 

n (XN)* 


\N 


p = 0 


P! 


which  proves  (3.28).  This  completes  the  proof  of  Lemma  3.1. 


Lemma  3.  2.  For  the  matrix  defined  by  (3.  21),  one  has 

(3.32)  lljNll  = |l(IN-KNr1||  <eW 
for  all  positive  integers  N. 

Proof:  This  follows  immediately  from  (3.19)  and  Lemma  3.1,  as 
one  may  take  X = ap/N. 

N 

Thus,  it  is  always  possible  to  solve  equations  (3.13)  for  F and 
$ , furthermore, 

(3.33)  llFN||  < , ll®N|| 
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uniformly  in  N for  bounded,  Riemann-integrable  functions  f(x),  </>(x). 
Actually,  it  will  not  be  a particular  handicap  to  restrict  the  following 
discussion  to  continuous  functions,  and  use  the  symbol  II  II  for  the  norm 
in  ^ and  C(I),  as  its  significance  will  always  be  clear  from  the 
context.  In  fact,  some  error  estimates  for  various  methods  of  numerical 
integration  involve  derivatives  of  the  integrand.  To  use  such  estimates, 
the  class  of  functions  considered  has  to  be  restricted  to  those  which  are 
sufficiently  smooth,  and  bounds  for  derivatives  of  these  functions  would 
enter  into  the  definition  of  the  norm. 


The  following  assumptions  will  be  made  concerning  the  rates  of 
convergence  of  the  quadrature  formulas  used: 


J h(x)dx  - Yj  c-  h(U 


(3.34)  < 


0 


j = 0 


) ) 


<£  (N)  || h ||  , 

P P 


^i  i 

/ h(x)dx~  £ A(£.) 
0 j=0  } 1 


< e (N)  || h 


0,1 


where  e (N),  e (N)  are  called  the  rates  of  convergence  of  the  integration 
formulas,13  with 


(3.  35) 


lim  e (N)  =•  lim  e (N)  = 0 , 


N -•*<» 


N -*  oo 


(it  is  assumed  that  e (N)  is  independent  of  i),  and  ||  ||p , II  II  , are 
suitable  norms,  the  latter  being  defined  on  the  interval  [0,£  ].  ’ ' 

Lemma  3.  3.  Suppose  that  h(x)  and  V(x,  y)  are  smooth  in  the 
sense  that 


(3.  36) 


II  V(x  - ) JJh  ||  , < M-.(JJh),  0 < x < 1 , 

o,i  1 


uniformly  in  i.  Then, 

(3.  37)  |(J  h)  - (]Jh)(£.)  | < e (N)u  (Jh^  . 

IN  1 l 0"  X 

In  particular,  if  {i.,}  is  a sequence  of  positive  integers  such  that 

lim  £ - i , 

N — oo  N 


then 
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then 


lim  (Jh)  = (Jh)(e) 

N-*oo  N 


uniformly. 

Proof:  For  simplicity  of  notation,  put  g = Jh.  By  definition, 

^i 

(Vg)(^.)  = / V(£  , y)g(y)dy  , 

0 


and 


(kn9)i 


|0c“v<vV9(6i) 


Thus,  from  (3.34), 

(3-38)  If K h-Vg||<e  (N)  sup  ||  V(|  , * )g  || 

1M  (J  / , \ 1 CF  _ X 

(1) 

- > 

where  the  norm  on  the  left  is  taken  in  where  Vg  = (Vg(|^),  . . . , Vg(^)). 

One  has 

iNh  = V1  - v>ih  - 


and 

Th  = JN(IN  - KN)Ih  ' 

Consequently, 

(INh)i  - (Ih,(h>  = Vhj  - V)Ih  = W - V9)  ' 

from  which  (3.  37)  and  the  uniform  convergence  of  the  numerical  values 
(J  Jh),  to  (tfh)(£)  follow. 

N 


173 


Remark  3.1.  Since  Jh  .is  smoother  than  h,  condition  (3.36)  is  not 
as  restrictive  as  it  might  appear. 

Remark  3.  2.  Suppose  that  one  chooses  the  simple  trapezoidal  rule 
of  numerical  integration  [1.  pp.  15-17].  Then,  one  may  take 
II  II  , = II  II  , and 
C2,i 

£ (N)  = -J-T,  |UL  (JTh)  < K,  II flh  l|  < K,  llh||  , 

a 12N2  1 1 C2  2 ° 

where  the  constants  k^,  k2  depend  only  on  V(x,  y).  In  this  case, 


(3.  39) 


ll(INh)-IhllRN+1i^7llhllC- 


Lemma  3.4.  Under  the  above  assumptions, 

1 

(3.40)  lim  <JN^,  i|>)  = / ^(x)(Jfa)(x)dx  . 

N -*-oo  0 

Proof:  Since  <p  is  smooth,  one  may  take  - kIIf^II  , where 

k depends  only  on  V,  and  ||  |[  denotes  ||  ||  , defined  on^the 

entire  interval  [0,1].  This  gives  ’ 

l(JN4>).  - (Fv>)(ei)  I < eo.(N)Ke^M'  ||  ||o  , 

i = 0, 1,  . . . , N.  Thus,  using  (3.18), 


(3.41) 


N N 

2 c (j  <p)  ip  - E c (iM(£  HU.) 
i=0  i=  0 


N 


< e (N)KeaH'||]J<p||  J lcN|  I+-  I < 
cr  cr  , ^ _ i i 


1 = 0 


< e^(N)KeQ,|J'  || ]J<p  ||^_  llijj  II  c«(l  + ^)  • 


Now,  by  (3.  34), 
(3.42) 


N 1 

E c,-  UfoOUiMSJ  ' / ^(x)(H<p)(x)dx 

i = 0 0 

<e(N)||^||  . 

p p 
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From  (3.41)  and  (3.42), 


(3.43)  ^(x)(J<p)(x)dx  I < k^£o.(N)(1  + ^-)  + K^e^(N)  , 

from  which  (3.40)  follows  immediately. 

Remark  3,  3.  One  may  use  the  smoothness  of  flip  to  replace  ||  JJ<p  || 
by  '1 93 1' o-_ 2 ( 3*  41),  where  the  constant  depends  only  on  V. 

Remark  3.  4.  If  the  trapezoidal  rule  is  used,  then 


X 

On<p,i|j>m~  / iJj(x)(lM(x)dx 


N 


< t(k  + K ) + 

12N2  “■  P 


12N 


3 K(r  ’ 


where,  since  a = 1, 


and  depends  only  on  V. 

Lemma  3.  5.  As  in  Lemma  3.  4, 

1 

(3.44)  lim  <J  = / i|j(x)(]If)(x)dx  . 

N->m  0 

The  same  type  of  convergence  rate  as  (3.43)  may  also  be  obtained  in 
this  case. 

Theorem  3.1.  Suppose  that  the  original  integral  equation  (3.1)  is 
uniquely  solvable  for  all  f.  Then,  a positive  .integer  exists  such 
that  for  all  N > N^,  the  discrete  system  (3.12)  is  uniquely  solvable  for 
all  f,  furthermore,  the  solutions  u^  of  (3.12)  converge  to  the  solution 
u(x)  of  (3.1)  as  N -►  00. 

Proof:  The  condition  1 - ($,iJj)  # 0 is  necessary  and  sufficient  for 
the  unique  solvability  of  (3.1).  Consequently 

(3.  45)  |l  - (&,  4*)  | = 26  > 0 . 
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. V 

Recalling  that  $ = flip,  $ = J <p,  it  follows  from  Lemma  (3.4)  that  a 

positive  integer  exists  such  that 

(3.46)  |<$N,  i|j>n  - (4>,  40  I < 6 
provided  N > NQ.  Thus,  as 

(3.47)  |l  - <$N,4;>nI  > |l  - (i>,4)  I - |<$N,^>n  - (<£,^)l  >6>0, 

the  finite  system  (3.12)  is  uniquely  solvable  for  all  f as  long  as  N > N^. 
The  convergence  of  the  numerical  solutions  (3.16)  to  the  analytic  solution 
(3.  5)  follows  at  once  from  Lemmas  3.3-  3.  5. 

The  inequalities  used  in  the  proofs  of  Lemmas  3.3-  3.  5 may  be  used 
to  obtain  the  following  result. 

Corollary  3.1.1.  If  N > N , then 

(3.48)  |uN  - u(|  ) | < 0(e  (N))  + 0(e  (N))  . 

lip  0" 

Thus,  the  convergence  rate  of  the  numerical  solution  is  determined 
by  the  slower  of  the  convergence  rates  of  the  numerical  integration  methods 
used,  and  is  of  the  same  order  of  magnitude. 

The  case  V(x,  x)  4 0 will  now  be  considered  briefly.  For  N > N. , 
where  is  some  positive  integer,  condition  (3.19)  guarantees  that 

ft  TvT 

(3.  49)  1 - K 4 0 , 

.U 


and  the  corresponding  value  used  in  subsequent  inequalities.  This  modifica- 
tion does  not  alter  the  rates  of  convergence  derived  previously. 
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§4.  Mildly  singular  Sturm- Liouville  problems.  The  technique  presented 
above  for  regular  boundary  value  problems  of  Sturm-Liouville  type  can  be 
extended  immediately  in  a formal  way  to  problems  which  involve  various 
types  of  singularities.  However,  the  analysis  becomes  more  delicate, 
particularly  for  the  corresponding  numerical  procedure.  As  an  example, 
consider  the  problem 


r 


ta[(1_x2)  S1  + CI(x)u  = Vx)- 


(4.1)  \ u(0)  = 0, 


u(l)  | < + oo  . 


V 

Assuming  that  q(x)  is  "small",  take 


(4.2) 


, d r , 2.  du  . 

Ln  = - — [ (1  - X ) — ] 
0 dx  dx 


The  functions  </>(x)  and  i|i(x)  defined  by  (1.5)  and  (1.6),  respectively,  are 
(4.  3)  <p(x)  = log  , ip(x)  = 1 , 


up  to  a constant  multiplier.  Their  Wronskian  is 


(4.4) 


W(x) 


log 


1 + x 

1 - x 


1 


1 1 
1 + X + 1 - X 


0 


so  that 

(4.  5)  m = (1  - x2)  W(x)  = -2  . 

The  corresponding  Green's  function  (1.9)  is 


(4.6) 


r 


G(x,y)  ={ 


1 . 1 + x 

~ log  - , 

2 1 - X 


1 , i ± y 

2 109  1 - y > 


X < y , 


x > y . 


Assuming  that  the  transformed  function 
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(4.7) 


1 

f(x)  = / G(x,y)f  (y)dy 
0 

exists,  the  technique  of  §2  may  be  applied  to  obtain  the  perturbed  Vol terra 
integral  equation 


(4.8) 


u(x)  - / V(x,  y)u(y)dy  = f(x)  + (u,JMx) 
0 


corresponding  to  (2.7),  where 


(4.9)  V(x,  y)  = - ^ [ log  - log  "7^ ] q(y)  - q(y)  log 


-V  /(l-x)(l+y) 

V (l+x)(l-y) 


and 

(4.10)  ip(x)  = - ~ q(x)  . 

The  analysis  of  §1.3  does  not  apply  directly  to  equation  (4.8),  as 
the  kernel  V(x,  y)  and  the  right-hand  side  of  the  equation  are  unbounded. 
However,  it  may  not  be  difficult  to  construct  a theory  of  equations  with 
singularities  of  this  type.  One  approach  would  be  to  consider  functions 
v(x)  such  that 

(4.11)  u(x)  = log(l  - x)  • v(x) 

is  bounded  at  x = 1,  and  derive  a regular  Volterra  integral  equation  for 
v(x).  For  numerical  work,  one  would  need  to  derive  formulas  for  numerical 
integration  which  apply  to  kernels  of  the  form  (4.9)  and  functions  (4.11) 

[5,  pp.  237-240  ] . 

§5.  Nonlinear  problems.  Attention  will  now  be  turned  to  nonlinear  boundary 
value  problems  of  the  form 

(5.1)  Lqu(x)  - F(x,  u(x))  = 0 , 

where  LQ  is  a regular  Sturm-Liouville  operator  of  the  form  (1.1),  and  one 
has  the  homogeneous  boundary  conditions  (1.3)  or  (1.4).  Taking  the  Green's 
function  G(x,  y)  of  the  operator  subject  to  the  given  boundary  conditions 

to  be  of  the  form  (1 . 9 ) 1 , equation  (5.1)  can  be  transformed  immediately  into 
the  nonlinear  integral  equation  of  Hammerstein  type 
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(5.2) 


1 


u(x)  - / G(x,  y)F(y,  u(y))dy  = 0 , 
0 


or,  using  (1.9)', 


x 1 

(5.3)  u(x)  - / 4j(x)<p(y)F(y,  u(y))dy  = <p(x)  f ^(y)F(y,  u(y))dy  . 

0 x 


X 

Adding  <p{x)  f t|j(y)F(y,  u(y))dy  to  both  sides  of  (5.3)  and  defining  the 
0 

scalar  unknown  c by 

1 

(5.  4)  c = / 4j(y)F(y,  u(y))dy 

0 

gives  the  new  problem 

x 

(5.5)  u(x)  - / V(x,  y)F(y,  u(y))dy  = c <p{x)  , 

0 

where 

(5.6)  V(x,  y)  = iJj(xMy)  - ^(x)ip(y)  . 

Equation  (5.  5)  thus  represents  a family  of  nonlinear  Volterra  integral  equations 
depending  on  the  parameter  c.  The  original  problem  (5.1)  (or  the 
Hammerstein  integral  equation  (5.  2))  is  equivalent  to  the  system  (5.4)-(5.  5). 
If,  for  example,  u(x)  satisfies  (5.  2),  then  it  will  satisfy  (5.  5)  for  the  value 
of  c given  by  (5.4).  On  the  other  hand,  suppose  that  (5.  5)  has  solutions 
U(x;c)  for  values  of  the  parameter  c which  includes  a solution  c = c'  of 
the  scalar  equation 

(5.7)  c = ¥(c)  , 
where 


1 

(5.8)  *(c)  = f ^(y)p(y,  u(y;c))dy  • 

0 

Then,  u(x)  = U(x;c  ) will  be  a solution  of  (5.2)  and  hence  of  (5.1). 
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The  structure  of  the  problem  considered  here  Is  essentially  the  same 
as  in  the  linear  case;  the  Hammerstein  integral  operator  IH  defined  by 

1 

(5.9)  (IHf)(x)  - / G(x,  y)F(y,f(y))dy 

0 

has  been  expressed  as  the  sum  of  the  Volterra  integral  operator  V with 
definition 


x 

(5.10)  (Vf)(x)  = f V(x,  y)F(y,  f(y))dy  , 

0 

and  the  operator  ® with  one-din  nnsional  range  given  by 

1 

(5.11)  (®f)(x)  = <p(x)  j i}j(y)F(y,  f(y))dy  . 

0 

However,  from  an  analytic  and  numerical  standpoint,  the  nonlinearity  of  the 
operators  V and  <S>  introduces  complications  which  are  ..ot  present  in  the 
linear  case.  One  has  to  settle  the  problems  of  existence,  uniqueness  or 
multiplicity,  and  dependence  on  the  parameter  c of  the  solutions  of  the 
nonlinear  integral  equations  (5.  5),  and  then  consider  existence  and  unique- 
ness of  the  fixed  points  of  the  scalar  operator  ^ defined  by  (5.8).  In 
the  linear  case,  the  Volterra  integral  equations  always  had  a unique  solution 
which  depends  linearly  on  the  parameter  c,  and  the  fixed  point  problem 
consisted  of  one  linear  equation  in  one  unknown.  To  get  an  idea  of  the 
complexity  involved  in  even  a simple-appearing  nonlinear  case,  one  should 
try  to  solve  (5.4)-(5.5)  with 

(5.12)  V(x,  y)  = x - y 

d2 

(which  corresponds  to  L = - — — with  the  boundary  conditions  (1.  3)),  and 

dx^ 

(5.13)  F(y,  u(y))  = u2(y)  , 

for  a solution  u(x)  which  does  not  vanish  identically. 
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In  the  case  of  numerical  solution  of  (5.  4)-(5.  5),  additional  complica- 
tions arise  from  the  fact  that  U(x;c)  and  *(c)  are  only  calculated 
approximately.  It  thus  appears  that  this  decomposition  of  the  Hammerstein 
integral  operator  does  not  lead  to  a solution  method  which  is  as  direct, 
simple,  and  effective  as  in  the  linear  case.  This  situation  immediately 
suggests  the  use  of  a linearization  technique,  such  as  Newton's  method. 
Here,  the  original  nonlinear  problem  would  be  approximated  by  a sequence 
of  linear  problems.  These  intermediate  problems  would  be  solved  by  the 
methods  developed  above  for  the  linear  case,  and  their  solutions  will 
converge  (under  certain  conditions)  to  the  solution  of  the  nonlinear  problem. 

To  be  specific,  suppose  that  uQ(x)  is  a function  which  one  may 
consider  to  be  an  approximate  solution  of  (5.1),  .in  a sense  to  be  made  more 
precise  later,  and  define 

(5.14)  FQ(x)  = Fz(x,u0(x))  , 

where  F (x,  z)  = F(x,  z),  and 
z 7 8z  ' ’ n 


(5.15)  Rq(x)  = Lquo(x)  - F(x,  uQ(x))  . 

To  obtain  the  next  approximation  u^(x)  to  u(x)  by  Newton's  method, 
one  would  solve  the  linear  problem 

(5-16)  LoVX^  " Fo^VX^  = ~VX) 

with  the  same  homogeneous  boundary  conditions  as  before  for 


(5.17) 


Pq(X)  = ux(x)  - uQ(x)  . 


In  addition  to  obtaining  the  next  approximation  u^(x)  from  the 
solution  of  (5.16),  one  may  analyze  this  linear  problem  for  satisfaction 
of  the  sufficient  conditions  for  the  convergence  of  the  sequence  of  functions 
{u  (x)},  obtained  by  Newton's  method,  to  u(x).  This  sequence  is 
constructed  by  defining 


(5.18) 


F (x)  = F (x,  u (x)) 
n zv  ’ n ’ 


Rn(x)  = L0un(x)  - F(x>  un(x))  » 


and  solving  the  linear  problems 
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(5.19) 


L r|  (x)  - F (x) r|  (x)  = -R  (x) 
0 'n  n 'n  n 


with  homogeneous  boundary  conditions  for  the  functions 
(5.20)  t^(x)  = un+1(x)  - un(x)  , 


n = 0, 1,  2, . . . . A convenient  theorem  to  use  in  this  connection  is  the 
one  due  to  L.  V.  Kantorovic  [6,9],  which  provides  sufficient  conditions 
for  the  convergence  of  the  sequence  {u^(x)}  and  error  estimates  [8,9]  for 

(5.21)  llu(x)  - u (x)  ||  = max  |u(x)  - u (x)  I 

n 0<x<l  “ 

in  the  space  of  continuous  functions  on  [0,1].  The  quantities  required  for 
the  application  of  this  theorem  are  estimates  for  IIhq  II,  a bound  for  the 
inverse  of  the  differential  operator  LQ  - Fq(x)I  (or  an  equivalent  integral 
operator),  and  a Lipschitz  constant  for  F (x,  z)  or  a bound  for  Fzz(x,  z)  in 
a suitably  defined  region  in  [0,1]  x (-«>,  +°°).  To  obtain  these  estimates, 
suppose  first  that  Lq  has  a Green's  function  of  the  form  (1.9)'.  The 
problem  (5.16)  may  then  be  transformed  into  the  perturbed  Volterra  integral 
equation 


(5.22) 


where 


(5.23) 


x 

n0(x)  ■ f Vx>  y)Ti0(y)dy  - (n0,J0)<p(x)  - -RQ(x ) , 


VQ(x,  y)  = Mx)i|i(y)  - </’(y)4J(x)  ] FQ(x)  , 


+0(x)  = 4^(X)F0(X) 


The  method  for  the  solution  of  this  problem  is  the  same  as  indicated  in 
equations  (3.2)~(3.  7).  Letting  VQ  denote  the  Volterra  integral  operator 
with  kernel  V (x,  y),  and 

(5-24)  J0  = (B  ~ VQ)_1  , 

then,  if 
(5. 25) 
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one  has 


(n0V^ 

(5.26)  r,  (x)  = -(J0R0)(x)  - ^~(V)(X)  ' 

1"(V,4'0) 


Knowing  r]  (x),  one  can  obtain  IUqH  directly.  As  an  alternative 
estimate,  (5.  26)  yields 


(5.27) 


' iu0ii  • «vir 

II  ho  II  < 

_ ii-(V-VL 

• llj0ll  • IIr0II  • 


If 

(5.28)  I VQ(x,  y)  | < m-q,  0 < y < x < 1 , 


then  one  may  use  (1.  20)  to  obtain 


(5.29) 


1 + 


li  - ( ^ - 


As  (5.  29)  holds  for  arbitrary  RQ(x),  one  has  further  that 


Mr 


(5.  30) 


Bo  = e 


1 + 


lli0ll  • II  v 
ll  - (J0*,I0) 


> II (L0  - F0(x)I) 


-1 


Now,  suppose  K = K(rQ)  > 0 exists  such  that 

f ||Fz(x,v(x))  - Fz(x,w(x))ll  < K II  v - w ||  , 

|v,  w € { U | II  u - Up  II  < rp  } , 


or 

(5.  32)  max  |F  (x,  z)  I < K,  (x,  z)  e [0,1]  X [ - ||uQ  II  - rQ,  II Uq  II  + r0l  * 
z z 

(Inequality  (5.  32)  implies  (5.  31);  however,  the  Lipschitz  condition  (5.  31) 
does  not  require  F(x,  z)  to  be  twice  differentiable  with  respect  to  z. ) 
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Once  the  above  computations  have  been  performed,  one  may  check 
the  satisfaction  of  the  hypotheses  of  the  following  theorem  on  the 
convergence  of  Newton's  method. 


(5.  33) 


Theorem  5.1  (Kantorovic) . If  (5.25)  is  satisfied,  and 

h0  = ® n0  I Bo  K - 2 


(5.  34) 


1 - VI  - 2h„ 


then  equation  (5.1)  has  a solution  u(x),  to  which  the  sequence  {u  (x)} 
defined  by  (5. 18)— ( 5.  20)  converges,  with  [ 8 ] n 


(5.  35) 


lu  - u ||  < 
n 


1 - 0 


2 VI  - 2b 


_,n  h 

2 0 


where 


! 5.  36) 


1 - VF-  2hQ 

r+'Vl  - 2hQ  • 


If  one  is  interested  also  in  the  value  of  the  "shooting  constant' 
appearing  in  (5.4)-(5.  5),  defining 


[5.  37) 


J. 

:n  = / lHy)F(y»  u (y))dy  , 


one  has  the  following  result. 


Corollary  5.1.  Under  the  hypotheses  of  Theorem  5.1,  one  ha£ 
' lc  - c | < K || 4/  fl  • ||u  - u ||  , 


(5.  38) 


where  )|u  - u ||  may  be  estimated  by  (5.  35). 
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Remark  5.1.  Using  (5.29)  and  (5.30),  inequality  (5.33)  may  be 
replaced  by  the  condition 


(5.39) 


'"oil  <-r^° 


1 + 


0II  • II V 


I 1 1IJq) 


1-2 


on  the  residue  function  RQ(x), 

For  numerical  purposes,  the  methods  of  §3  may  be  applied  to  the 
linear^problems  (5.19)  to  carry  out  the  iteration  procedure.  If  JJq <p  and 
(]J0<p,  ^q)  can  be  calculated  accurately  enough  to  guarantee  the  satisfac- 
tion of  (5.  25),  and  the  other  hypotheses  of  Theorem  5.1  can  be  verified, 
then  this  insures  that  the  original  problem  (5.1)  has  a solution  u(x). 

Any  result  obtained  numerically  can  be  treated  as  an  initial  approximation 
uQ(x)  to  u(x),  from  which  the  error  bound 

1 - n/I  - 2h 

(5.40)  II  U - U ||  < ||  T]  || 

0 ho  0 


follows  by  Theorem  5.1. 
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ABSTRACT . The  problem  of  calculating  the  dynamic  re- 
sponse produced  by  a vertical  harmonic  load  applied  to 
the  area  on  the  surface  of  a nonlinear  layered  elastic 
half  space  has  been  studied.  An  elastic  restoring  force 
which  includes  first,  third,  and  fifth  order  terms  in 
the  displacement  of  the  pavement  surface  is  required  to 
describe  the  measured  nonlinear  dynamic  response  of 
pavements  which  are  subjected  to  dynamic  loads.  The 
equation  of  motion  of  a nonlinear  oscillator  is  solved  for 
this  elastic  restoring  force,  and  the  dynamic  deflection 
of  the  pavement  surface  is  determined  as  a function  of  the 
applied  static  and  dynamic  loads.  The  mechanical  impedance 
of  a pavement,  which  is  modeled  as  a nonlinear  layered 
elastic  half  space,  Is  found  to  depend  on  the  values  of  the 
static  and  dynamic  load  applied  to  the  pavement  surface, 

1 , INTRODUCTION . Nondestructive  vibratory  testing  of 
pavements  may  be  of  importance  toward  predicting  the 
performance  of  airfield  pavements  and  may  be  used  for  the 
rapid  evaluation  of  pavement  strength,  (Ref,  1-3)  T°  be 
useful,  the  Physical  quantities  measured  by  the  non- 
destructive testinm  technlq\ie  must  be  related  to  pavement 
performance.  Pavement  performance  Is  measured  by  number 
of  aircraft  coverages  on  the  pavement  required  to  reach 
some  defined  condition  of  failure.  The  U.  S,  Army 
Engineer  Waterways  Experiment  Station  (WES)  was  requested, 
to  perform  experimental  and  theoretical  invest lmat ions  to 
determine  if  the  physical  quantities  measured  by  the  non- 
destructive technique  can  be  used  for  pavement  evaluation 
and  can  be  related  to  pavement  performance.  Some  of  the 
quantities  that  are  measurable  by  the  nondestructive 
vibratory  technique  ares 

a*  The  dynamic  deflection  of  the  pavement  surface 
versus  the  frequency  of  vibratory  loading  for 
a series  of  fixed  static  and  dynamic  loads. 

b.  The  stress  and  strain  distribution  in  the 
pavement  around  the  vibrator  measured  on 
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instrumented  pavement.  sections. 

c.  Rayleiyh  wave  dispersion  curves  p-ivin«r  phase 
velocity  versus  wavelength  measured  with  the 
wave  propagation  techniques. 

d.  The  dynamic  deflection  of  the  pavement  surface 
versus  the  dynamic  force  for  a series  of  fixed 
static  loads  and  fixed  frequencies. 

Fost  of  the  previous  work  on  the  nondestructive  testing  of 
pavements  has  treated  the  mechanical  quantities  listed  in 
Subparagraphs  a,  b,  and  c.  This  paper  concentrates 
primarily  on  the  nonlinear  response  exhibited  by  pavements 
through  the  measurements  listed  in  Subparagraph  d.  Further 
study  of  the  physical  quantities  mentioned  in  Subparagraphs 
a,  b*  anci  £.  should  be  made  in  the  light  of  the  new  results 
obtained  from  the  study  of  nonlinear  effects  in  pavements. 

The  overall  objectives  of  this  pavement  study  are: 

a.  The  development  of  a mechanical  model  which 
describes  the  measured  response  of  pavements 
to  a sinusoidal  dynamic  loading  that  is 
applied  to  the  pavement  surface. 

b.  The  development  of  a method  for  determining 
the  subsurface  structure  of  the  pavement  in 
terms  of  the  measured  dynamic  response  of 
the  pavement. 

The  development  of  the  pavement  response  model  includes 
the  following  specific  objectives: 

a.  To  determine  the  effects  of  intrinsic  pavement 
properties  and  structure  on  the  dynamic  load- 
deflection  curves. 

b.  To  determine  the  effects  of  such  vibrator 
characteristics  as  dynamic  load,  static  load, 
and  contact  area  on  the  dynamic  load  deflection 
curves, 

c.  To  calculate  the  dynamic  stiffness  and  determine 
its  dependence  on  the  intrinsic  properties  of 
the  pavement  and  on  the  characteristics  of  the 
vibrator  used  to  measure  this  quantity. 
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d , To  develop  a theory  of  the  nonlinear  dynamic 

~ response  of  pavements  which  enables  the  compar- 
ison of  dynamic  stiffness  measurements  obtained 
from  different  vibrators  at  the  same  pavement 
location. 

The  theoretical  work  done  in  this  paper  may  have 
applications  for  the  nondestructive  testing  of  roads  and 
airport  pavements.  The  possible  practical  applications 
of  this  work  are  twofold.*  (a)  the  use  of  the  dynamic 
stiffness  measurement  for  determining  the  subsurface 
structure  of  the  pavement,  i.e.,  the  shear  modulus  and 
thickness  for  each  pavement  layer,  and  (b)  the  development 
of  the  capability  of  comparing  the  values  of  the  dynamic 
stiffness  measured  by  different  vibrators  at  the  same 
pavement  location. 

To  achieve  the  objectives  listed  above,  both  theoretical 
and  exDerlmental  studies  were  conducted. 

The  theoretical  studies  Included : 

a.  The  formulation  of  a nonlinear  mechanical  model 
to  describe  the  response  of  a pavement  to  static 
and  dynamic  loading. 

b.  The  determination  of  effects  of  the  structure  of 
the  pavement-soil  system  on  the  parameters 
which  appear  in  the  nonlinear  vibration  model, 

c.  A numerical  evaluation  of  the  parameters  that 
appear  in  the  nonlinear  model, 

d.  The  development  of  formulas  giving  the  shear 
modulus  and  layer  thickness  of  each  pavement 
layer  in  terms  of  quantities  that  are  obtained 
directly  from  the  measured  dynamic  load- 
deflection  curves. 

The  experimental  studies  performed  on  both  actual  airport 
pavements  and  especially  constructed  test  sections  included: 

a.  The  measurement  of  dynamic  load-deflection 

curves  using  a vibrator  developed  at  WES  which 
can  venerate  dynamic  loads  up  to  l6  kips  (WES 
16-kip  vibrator)  with  a constant  l6-kip  static 
load  and  a constant  frequency  of  15  Hz. 
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b.  The  measurement  of  dynamic  load-deflection 
curves  at  a constant  static  load  of  l6  kips  for 
a series  of  fixed  frequencies  in  the  range  from 
10-40  Hz. 

c.  The  measurement  of  dynamic  load-deflection 
curves  at  a constant  static  load  of  16  kips  and 
a constant  frequency  of  15  Hz  for  a series  of 
baseplates  whose  diameters  ranged  from  5-18  in. 

d.  The  measurement  of  dynamic  load-deflection 
curves  at  constant  frequency  and  constant 
baseplate  size  for  a range  of  static  loads  from 
5-50  kips. 

2,  LINEAR  OSCILLATOR  MODEL  OF  PAVEMENT  RESPONSE,  Non- 
destructive vibratory  testing  of  pavements  uses  a mechanical 
vibrator  operating  at  a known  frequency  and  dynamic  force 
applied  to  the  pavement  surface  to  produce  a time-dependent 
sinusoidal  deflection  of  the  pavement  surface  directly 
beneath  the  vibrator  baseplate.  The  magnitude  of  the 
dynamic  deflection  of  the  pavement  surface  for  a series  of 
dynamic  force  levels  and  frequencies  is  considered  to  be  a 
measure  of  the  strength  of  a pavement.  This  section 
discusses  a linear  oscillator  model  used  to  describe  the 
motion  of  the  surface  of  a linear  elastic  half-space  and  then 
shows  how  this  model  fails  to  account  for  the  measured  values 
of  the  dynamic  deflection  of  the  pavement  for  a series  of 
frequencies  and  dynamic  loadings  generated  by  the  vibrator. 
The  concepts  of  dynamic  stiffness  and  deflection  are  intro- 
duced, and  the  separation  of  static  and  dynamic  displacements 
is  demonstrated. 

The  equation  of  motion  of  a mass  of  pavement  or  soil 
undergoing  vertical  oscillations  on  the  surface  of  a homo- 
geneous elastic  half-space  is 

(1)  mx  + CHx  + k^x  = Fy(t) 

where 

m = lumped  mass  of  pavement  and  soil 

x « acceleration  of  pavement  surface 

2 

C - damping  constant  ( 3 * 4avGY/g/(l  - v), 

H where  a is  the  contact  radius  of  the 
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vibrator  baseplate,  G is  the  shear  modulus 
of  the  half-space,  y is  the  density  by 
weight  of  the  half-space,  g is  the  acceler- 
ation due  to  gravity,  and  v is  Poisson's 
ratio,  Reference  4) 

x - velocity  of  Davement  surface 

k - spring  constant  (4GA/(1  -v),  Reference  4) 

H 

x = total  elastic  deflection  of  the  pavement 
surface  under  the  vibrator  baseplate 

F (t)  = total  force  applied  to  the  pavement  surface 
^ (static  plus  dynamic) 

t = time 

The  values  of  and  are  chosen  to  construct  a damped 

soring  model  for  the  vertical  vibrations  of  an  elastic  half- 
space; therefore  C represents  the  radiation  damping;  of 

H 

the  system.  If  viscous  friction  is  present,  the  value  of 
the  actual  damping;  constant  may  be  considerably  larger  than 

the  value  of  C . 

H 

The  total  force  applied  by  the  vibrator  is  written  as 

(2)  F (t)  = F + F (t) 

V s D 

where  F equals  static  load  and  F (t)  equals  dynamic 
s D 

load.  The  total  displacement  can  be  written  as 

(3)  x - x + 5 

e 

where  x^  is  the  static  elastic  deflection  of  the  pavement 

surface  beneath  the  vibrator  baseplate,  F /k  , and  r is 

s H * 

the  dynamic  elastic  deflection  of  pavement  surface  beneath 
the  vibrator  baseplate  measured  from  the  static  equilibrium 
deflection.  Combining  Equations  1,  2,  and  3 gives  the 
equation  of  motion  as 

W mg  + Ch5  + kH£  = FD(t) 
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wherein  all  static  forces  and  displacements  have  canceled. 
Therefore,  for  a linear  system,  the  static  deflection  does 
not  affect  the  dynamic  response  of  the  vibrator  mass;  only 
the  reference  point  is  changed  by  the  static  load. 

For  a sinusoidal  driving*-  force,  the  dynamic  deflection 
obtained  from  Equation  4-  is  (Reference  5) 


FD(u>)e 


i (a)t-A) 


(*5) 


K = 


2 

2 \ 2 2 

1 kn  - ma)  i + V 


= magnitude  of  the  sinusoidal  dynamic  force 
applied  to  pavement  surface 

= complex  number  notation  for  a sinusoidal 
time  dependence  where  1 = iT-1  , w = 
angular  frequency,  and  A = phase  ansrle 
between  the  dynamic  load  applied  to  the 
pavement  surface  and  the  dynamic  deflec- 
tion of  the  pavement  surface. 

Two  kinds  of  dynamic  response  curves  of  physical  Interest 
can  be  obtained  from  Equation  5* 

a.  Dynamic  deflection  versus  frequency. 

b0  Dynamic  deflection  versus  dynamic  force. 

For  a linear  system,  the  magnitude  of  the  maximum  dynamic 
deflection  is  a simple  linear  function  of  F (w)  as  shown 
in  Fierure  la.  The  magnitude  of  the  peak  3 dynamic 
deflection  as  a function  of  frequency  appears  in  Figure  IB 
for  a constant  force  vibrator  and  for  an  eccentric  mass 
vibrator  (where  the  dynamic  force  is  frequency-dependent 

2 

in  the  manner  ? ( w)  oj  ) . The  WSS  l6-kip  vibrator  is  a 

constant  force  vibrator.  Therefore,  for  a linear  system  the 
dependence  of  £ on  w is  rather  complicated,  but  the 


where : 

F (to) 
D 

i ( tot-A  1 
e 
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dependence  of  K on  F (<o)  is  given  simply  by  a straight 

line  whose  slope  is  the  dynamic  stiffness.  The  phase  angle 

A is  assumed  to  be  the  same  for  all  of  the  elements  of 
the  mass  of  pavement  and  subgrade  which  enter  into  motion 
with  the  vibrator  mass.  This  is  the  lumped  mass  assumption, 
which  requires  that  m be  interpreted  as  an  effective  mass 
which  vibrates  in  phase  with  the  vibrator  mass  and  has  a 
value  which  is  determined  by  requiring  that  the  theoretical 
frequency  resoonse  curves  agree  with  the  measured  frequency 
response  curves, 

Equation  5 shows  that  for  a linear  system,  the  dynamic 
stiffness  is  given  by 


(6) 


s = 


mw 


2 2 
CH» 


and  depends  only  on  the  following  quantities: 

a.  Frequency* 

b.  Spring  constant. 


c.  Damping  constant. 

d.  Lumped  mass  of  pavement  and  soil. 

The  elastic  parameters  of  the  pavement,  G and  v , and  the 

radius  of  the  contact  area  of  the  vibrator  with  the  pavement 

enter  the  dynamic  stiffness  through  k and  C as  seen 

H H 

from  Equation  6 and  the  expressions  for  the  spring  constant 
and  damping  constant.  For  a linear  oscillator  model,  the 
dynamic  stiffness  does  not  depend  on  the  dynamic  load  or  on 
the  equilibrium  elastic  deflection,  i.e.,  i is  a linear 
function  of  F (to  ) . However,  the  experimental  values  of 

D 

the  dynamic  stiffness  of  pavements  indicate  a strong  de- 
pendence o^  the  dynamic  stiffness  on  the  dynamic  load  and 
on  the  static  elastic  equilibrium  displacement  of  the  pave- 
ment surface.  Therefore,  the  linear  oscillator  model  is 
insufficient  to  describe  the  response  of  pavements  to  dynamic 
loadings,  and  a nonlinear  oscillator  model  is  required  to 
explain  the  experimental  data. 
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3.  THE  NONLINEAR  MECHANICAL  MODEL.  In  this  section,  a 
nonlinear  mechanical  model  is  developed  to  describe  the 
response  of  a pavement -subgrade  system  to  a sinusoidal 
dynamic  loading  applied  to  the  surface  of  the  pavement  by  a 
vibrator.  The  model  is  developed  in  three  basic  steps: 

a.  The  determination  of  the  nonlinear  pavement- 
restoring force  in  terms  of  three  parameters: 
the  linear  elastic  parameter  of  a nonlinear 
pavement,  the  third-order  nonlinear  elastic 
parameter,  and  the  fifth-order  nonlinear  elastic 
parameter. 

b»  The  solution  of  the  motion  equation  (4)  for  the 
case  of  the  nonlinear  oavement-restoring  force 
and  the  subsequent  calculation  of  the  dynamic 
stiffness  and  deflection  of  the  pavement  as  a 
function  of  the  static  and  dynamic  loads  exerted 
by  the  vibrator. 

£,  The  determination  of  the  parameters  k , b , 

and  e in  terms  of  the  elastic  constants  of  the 
layered  pavement -subgrade  system  and  in  terms  of 
the  finite  depth  of  influence  that  a static 
surface  load  produces  in  this  system. 

If  for  a fixed  frequency  the  dynamic  deflection  of  the 
Davement  surface  is  not  directly  proportional  to  the 
dynamic  force,  the  system  is  said  to  be  nonlinear.  The 
experimental  data  indicate  that  this  is  the  case  for  most 
asphaltic  concrete  (AC)  pavements  and  for  some  Portland, 
cement  concrete  (PCC)  pavements.  It  will  be  shown  that  the 
nonlinear  behavior  of  a pavement  undergoing  forced  sinusoidal 
vibrations  can  produce  very  different  values  of  dynamic 
stiffness  such  as  those  measured  at  the  same  location  by 
different  mechanical  vibrators.  Therefore,  it  is  Important 
to  be  able  to  account  for  the  nonlinear  effects  by  a simple 
physical  model, 

A physical  and  mathematical  model  for  the  nonlinear 
response  of  pavements  can  be  derived  which  will  account  for 
the  dependence  of  the  impedance  values  of  the  type  of 
vibrator  used  to  determine  then,  i.e,,  on  the  static  weight, 
dynamic  load,  and  contact  area  of  the  vibrator.  This  paper 
will  show  that  it  is  possible  to  describe  the  dependence  of 
the  measured  values  of  pavement  dynamic  stiffness  on  the 
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physical  characteristics  of  the  vibrator  by  introducing 
three  parameters  to  describe  the  nonlinear  pavement- 
restoring  force. 


The  pavement-restoring  force  is  the  force  that  the  bulk 
pavement  exerts  on  the  lumped  pavement  mass  from  below.  In 
linear  Equation  1,  the  navement-re storing  force  is  simply 

k x . In  general,  the  pavement  restoring  force  is  not 
H 

equal  to  the  force  generated  by  the  vibrator;  only  for  the 
static  case  are  these  two  forces  equal.  The  first  task 
to  be  accomplished  is  the  development  of  a mathematical 
expression  for  the  pavement-restoring  force  which  satisfies 
the  following  two  very  general  criteria t 


a.  The  mathematical  form  of  the  pavement-restoring 
force  will  be  sufficiently  general  so  that  the 
nonlinear  dynamic  response  of  the  pavement  that 
Is  calculated  from  this  restoring  force  will  be 
adequate  to  describe  the  experimental  nonlinear 
dynamic  load-deflection  curves. 

b.  Only  terms  based  on  sound  physical  theory  are 
included  in  the  mathematical  form  of  the  pavement- 
restoring force. 


The  form  of  the  nonlinear  elastic  pavement-restoring  force 
used  in  the  nonlinear  model  is  determined  by  requiring  the 
restoring  force  to  be  antisymmetric  in  the  deflection  of  the 
pavement  surface,  i.e., 


(7)  F (x)  = -F(-x) 

P P 

where  F^(x)  equals  the  pavement-restoring  force.  Equation 

7 is  satisfied  for  the  linear  case,  F - k x . A simple 

P H 

nonlinear  pavement-restoring  force  which  satisfies  Equation 
? and  which  is  found  to  be  adequate  to  describe  the  dynamic 
load-deflection  curves  for  pavements,  is 

q e; 

(8)  F^(x)  ~ ^00x  + bx  + ex  ' 

where  k equals  the  linear  elastic  parameter  of  a nonlinear 
00 


pavement  while  b and  e equal  respectively  the  third-  and 
fifth-order  nonlinear  parameters.  The  experimental  data 
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indicate  that  at  least  two  nonlinear  elastic  parameters, 
b and  e , are  required  to  describe  AC  and  PCC  pavements. 
The  linear  spring  constant  k which  appears  in  Equation 

B is  not  in  general  equal  to  the  spring  constant  k which 

H 

describes  the  homogeneous  linear  elastic  half-space. 

The  equation  of  motion  of  the  oscillating  lumped  pavement 
and  soil  mass  can  now  be  written  using  the  expression  for 
the  nonlinear  pavement-restoring  force  derived  in  the  pre- 
voius  section.  This  equation  of  motion  cannot  be  completely 
se oarated  into  static  and  dynamic  parts  as  was  the  case  for 
the  linear  elastic  system. 


The  equation  of  motion  for  the  nonlinear  spring  is  given 
by 

(9)  mx  + Cx  + k „x  + bx^  + ex J - F (t) 

00  V 

where  C is  the  damning  constant  of  the  pav*  .nt-vlbrator 

system,  and  m is  the  in-phase  lumped  mass  of  the  pavement 

and  subgrade.  The  value  of  C is  larger  than  the  value  of 

the  radiation  damping  constant  C which  appears  in 

H 


Equation  6 because  C describes  several  material  damping 
processes  in  addition  to  the  dissipation  of  energy  by 
mechanical  radiation.  Equation  9 can  be  greatly  simplified 
by  choosing  a new  origin  of  coordinates  as  in  Equation  3» 
such  that  the  motion  is  described  in  te"ms  of  coordinates 
measured  relative  to  the  static  equilibrium  deflection. 

By  itself  the  static  load  produces  a static  deflection  given 

By 

3 5 

(10)  F = k x + bx  + ex 

s 00  e e e 


Substituting  Equation  10  into  Equation  9 enables  the  equation 
of  motion  to  be  written  as 

i 3 3 \ / 

(11)  mx  + Cx  + k (x  - x ) + b|x  - x" 


00 


+ e x - x" 
\ e 


= F (t) 
D 


Using  Equation  3 and  the  following  algebraic  Identities: 
(12) 


3 3 , . 

X - X = (x  - X ) 
e e 


2 ^ 2 
X + XX  + X 

e e 


.5  _ 


2 2 


(13)  x-x7=(x-x)x  +xx  + x'x  +xx  + x 
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allows  Equation  11  to  be  rewritten  as 


(14) 


+ C£,  + k % + b?  + eE,  = F (t) 


D' 


where  k^  is  the  effective  quasi-static  spring  constant 
and  is  defined  by 

2 4 

(15)  k = k + 3bx  + $ex  + g(x  r) 

0 00  e e e 

and 


(16) 


j 2 2 

g(x  5)  = 3bx  g + lOex  5 + lOex  5 + 5ex  5 


Equation  14  is  a generalization  of  the  Duffing  Equation 
(Reference  6). 

In  this  section,  the  equation  which  determines  the 
amplitude  of  the  sinusoidal  dynamic  deflection  of  the 
pavement  surface  beneath  the  vibrator  mass  is  developed. 

The  amplitude  equation  is  expressed  in  terms  of  an  effective 
soring  constant  which  in  turn  depends  on  the  static  arid 
dynamic  deflections  of  the  pavement  surface.  The  dynamic 
stiffness  for  the  nonlinear  system  will  eventually  be 
expressed  in  terms  of  this  effective  soring  constant. 


The  functions 


k (x  l ) 
0 e 


and 


g(x  5) 
e 


are  time-dependent , 


and  therefore  Equation  14  is  very  difficult  to  solve  exactly, 
Under  special  conditions  to  be  described,  the  coefficient 
which  appears  in  Equation  14  may  be  taken  to  be  Independent 
of  time,  thereby  making  this  equation  somewhat  easier  to 
solve.  For  harmonic  motion,  the  dynamic  force  applied  to 
the  pavement  surface  by  the  vibrator  can  be  written  as 


(17) 


FD(t)  = 


icot 


(18) 


5(t)  = Aei(wt~A) 


where  A equals  the  amplitude  of  the  dynamic  deflection 
of  the  pavement  surface  directly  beneath  the  vibrator  base- 
plate, The  dynamic  deflection  of  the  lumped  mass  is  assumed 
to  be  equal  to  the  dynamic  deflection  of  the  pavement  surface, 
For  the  case  in  which  the  dynamic  deflection  amplitude  is 

much  less  than  the  static  equilibrium  deflection,  A « x , 

e 
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g(x  c)  ~ 0 can  be  used  In  Equation  15*  while  for  the  case 
e , 

4 

where  A ~ x , the  time -averaged  value  of  g(x  5)  — lOex 
e e e 

can  be  used  in  Equation  13.  For  the  two  special  cases,  the 

coefficient  k can  be  written  as 
0 


(19) 


(20) 


2 4 

k = k + 3bx  + 5ex  A <<  x 
0 00  e e e 

2 4 

k = k + 3bx  + 15ex  A ~ x 
0 00  e e « 


A simple  linear  interpolation  formula  for  k^  is  given  by 


(21) 


2 / a ' 4 

k = k + 3bx  + 5il  +2  — ex 
0 00  e x e 

\ el 


It  should  be  noted  that  the  choice  of  k^  as  time-independent 

is  an  approximation  which  becomes  invlald  for  large  dynamic 
deflections . 

Sven  with  the  coefficient  k^  assumed  to  be  time- 

independent,  Equation  14  is  a nonlinear  equation.  However, 
it  can  be  shown  that  Equation  14  can  be  cast  into  the  form 
of  an  equivalent  linear  system  whose  amplitude  equation  is 
(Reference  6) 


(22) 


2 2 i 
+ C to  I 


L ' ' j 

provided  an  effective  spring  constant 
defined  by 


fdm 

is  introduced  which  is 


(23) 


2 4 

k = k + ubA  + neA 
0 


where  k is  given  by  Equation  21,  u = 3/4  , and  n = 5/8 


The  effective  spring  constant,  k , is  seen  to  be  a function 
of  the  amplitude  of  the  dynamic  deflection  and  also  depends 
on  the  static  equilibrium  deflection  through  the  coefficient 
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k^  • If  the  static  load  applied  to  the  pavement  by  the 

vibrator  were  zero,  then  x =0,  g(x  r)  = 0 , and  k = k 

e e 0 00 

For  this  case,  there  could  be  no  coupling  of  terms  between 
x and  5 and  the  effective  spring  constant  would  depend 

only  on  the  amplitude  of  the  dynamic  deflection.  On  the 
other  hand,  if  the  dynamic  load  were  zero,  the  effective 
spring  constant  would  be  k = kQ  and  would  depend  only  on 

the  static  equilibrium  deflection. 

4,  CALCULATION  OF  THE  DYNAMIC  STIFFNESS  AND  THE  DEFLECTION 
AMPLITUDE  FOR  NONLINEAR  PAVEMENTS,  This  section  considers 
the  calculation’  of  the  dynamic  stiffness  and  the  dynamic 
deflection  of  the  pavement  surface  and  dynamic  forces 
generated  by  the  vibrator.  The  deflection  amplitude  equation 
(22)  derived  in  the  previous  section  is  expanded  in  powers 
of  the  deflection  amplitude,  A , to  give  a tenth-order 
algebraic  equation  for  the  determination  of  A , Infinite 
series  expansions  for  the  dynamic  amplitude  and  the  dynamic 
stiffness  are  obtained  as  solutions  to  this  equation. 

These  solutions  express  the  dynamic  stiffness  and  deflection 
as  functions  of  the  dynamic  load  generated  by  the  vibrator 
and  the  static  deflection  of  the  pavement  surface.  The 
static  deflection  is  then  expressed  In  terms  of  the  static 
load,  so  that  finally  the  dynamic  stiffness  and  deflection 
are  expressed  in  terms  of  the  static  and  dynamic  loads  at 
which  the  vibrator  is  operated. 


The  explicit  equation  for  the  dynamic  deflection  amplitude 
will  now  be  calculated.  The  dynamic  stiffness  of  the  pave- 
ment which  is  described  by  a nonlinear  oscillator  is  given 
by 


(24) 


S * 


k - mw 


2! 


\ 


+ C 


2 2 


where  the  effective  spring  constant  is  given  by  Equation  23 1 
The  dynamic  stiffness  depends  on  the  amplitude  of  the 
dynamic  displacement  and  the  static  equilibrium  deflection. 
The  amplitude  of  the  dynamic  deflection  is  determined  by 
Equation  22,  which  may  be  written  as 


(2*5) 


2 2 2,  v 
AS  * F U) 
D 
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Using  Equations  23  and  24,  the  amplitude  equation  (2S)  can 
be  written  as 


(26) 


2 2 

S A + 2ubi  k 
0 ■>  0 


2\  4 2 1 2 2 6 

mw  ;A  + 2n  'k  - mai  e + u b A 


8 2 2 10  2 ■ 
+ 2|iribeA  + ri  e A = i*1  ( oj  ) 

D 


where  Sq  is  the  value  of  the  dynamic  stiffness  obtained 

from  S by  taking  k = k (or  equivalently  A - 0 in 

0 

Equation  23)  and  is  defined  by  the  equations 

. , 2 / 2 12  2 2 
(27)  S = k - uni  ! + C w 

0 \ 0 / 


Whereas  the  simple  linear  system  Droduces  a linear  equation 
for  the  calculation  of  the  dynamic  displacement  in  terms  of 
the  dynamic  force,  the  nonlinear  system  appropriate  to 
describe  dynamic  pavement  response  produces  a fifth-order 

2 , v 

equation  for  calculating  A in  terms  of  F ( w)  . The 
value  5^  appearing  in  Equation  2?  is  the  dynamic  stiffness 
in  the  limit  of  zero  dynamic  loading. 

The  tenth-order  equation  (26)  will  now  be  solved  for  the 
dynamic  amplitude  A which  will  take  the  form  of  an  infinite 
series  expansion.  The  dynamic  stiffness  is  calculated  in 
terms  of  A by  Equation  25  so  that  S also  will  have  the 
form  of  an  infinite  series  expansion. 

The  solution  of  Equation  26  for  the  amplitude  of  the 
dynamic  displacement  in  terms  of  the  amplitude  of  the  dynamic 
force  is,  in  general,  difficult  to  obtain  analytically.  For 
the  case  in  which  the  dynamic  force  is  not  very  large,  the 
amplitude  of  motion  and  the  dynamic  stiffness  are  easily 
obtained  from  Equation  26  In  the  form  of 


(28) 
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(29) 

where 


I 2 

5=3  1 + 3^  + 3^  + 

0 V 2^ 


a , a = coefficient  appearing  in  the  power  series 
1 2 expansion  of  the  amplitude  of  the  dynamic 

deflection 


\p  = expansion  parameter 

coefficients  appearj 
expansion  of  the  dynamic  stiffness 


3,3  = coefficients  appearing:  in  the  power  series 

1 2 


The  values  of  <p  , , x^  » 3^  , and  3^  can  be  obtained 

by  combining  Equations  26  and  2B  with  the  following  results? 
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The  solutions  .viven  in  Equations  28  and  29  are  valid  provided 
the  dynamic  load  is  not  so  largre  as  to  prevent  the  convergence 
of  these  power  se>',»s  solutions.  Equations  28-29  have  been 
derived  from  Equation  25  and  grive  the  fundamental  description 
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of  the  nonlinear  dynamic  load-deflection  curves.  These 
equations  will  be  fitted  to  experimental  dynamic  load- 
deflection  curves. 


Equation  28  shows  that  the  amplitude  of  the  dynamic 

deflection  is  not  a linear  function  of  F but  approaches 

D 

the  linear  condition  for  small  ? or  large  S . The 

D 0 

linear  system  can  be  regained  by  setting  b = e = 0 in 
'Equations  28-34.  When  F^  = 0 , A = 0 and  S = , and  the 

dynamic  stiffness  depends  only  on  the  static  equilibrium 

deflection.  If  F = 0 , the  deflection  and  stiffness  are 

s 

given  by  Equations  28-34  with  the  provision  that  be 


replaced  by  the  constant 


The  static  load  through 


Equation  10.  Therefore,  in  general,  the  dynamic  stiffness 
of  a nonlinear  system  will  depend  on  the  magnitude  of  F^ 

and  F , The  dependence  of  S on  F_  enters  through  the 
s D 

expansion  parameter  ip  given  in  Equation  30,  while  the 

dependence  of  S on  F enters  through  the  function 

s 0 

given  by  Equation  27. 


The  expression  for  the  dynamic  amplitude  A given  in 
Equation  28  shows  that  A does  not  depend  linearly  on  F . 

D 2 

The  departure  from  linearity  is  due  to  the  terms  , 

. that  appear  in  Equation  28,  It  is  desirable  to  determine 
the  physical  quantities  which  determine  the  degree  of 
departure  from  the  linear  condition,  A - F /S  . In  the 

U 

range  of  small  F , the  predominant  term  describing  the 

D 

nonlinear  behavior  of  the  deflection  of  the  pavement  mass  is 
obtained  from  Equations  28,  30,  and  31  as  follows  * 
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In  general  the  degree  of  nonlinearity  depends  on  four 
quantities i 

a.  The  magnitude  of  the  nonlinear  parameters  b and  e . 

b.  The  relative  magnitudes  of  F„  and  F , 

S D 

c.  The  frequency  at  which  the  vibrator  is  operated, 

d.  The  static  stiffness  of  the  pavement- 

vibrator  system. 


The  oarameter  \p  , which  appears  in  the  infinite  series 
expansion  for  A and  S in  Equations  28-30,  depends 

-4 

Inversely  on  in  the  form  , and  therefore  it  follows 

that  the  dynamic  load-deflection  curves  of  stiff  pavements 
are  more  linear  than  those  of  the  more  flexible  pavements. 
Thus  concrete  pavements  are  expected  to  have  a more  linear 
resoonse  to  a dynamic  loading  than  do  the  more  flexible 

asphalt  pavements.  The  value  of  S includes  the  effects 

0 

of  the  subgrade  as  well  as  the  effects  of  each  layer  in  the 
pavement . 

It  is  clear  from  Equations  28  and  35  that  the  degree  of 
departure  from  the  linear  condition  expected  for  the  response 
of  a pavement  to  an  applied  vibratory  load  at  the  pavement 
surface  depends  on  the  frequency  at  which  the  vibrator  is 
operated.  In  particular,  it  is  apparent  from  liquation  35 
that  the  first-order  nonlinear  term  is  frequency-dependent 
and  that  this  first-order  term  will  vanish  at  a special 

2 

critical  frequency  for  which  - mw  = 0 . 

It  is  a characteristic  property  of  the  first-order  non- 
linear term  (Equation  35)  that  there  is  a critical  frequency 
for  which  this  term  vanishes;  the  critical  frequency  is 
defined  by 


(36) 


U) 
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where  u is  the  critical  angular  frequency.  In  terms  of 
c 
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the  critical  frequency,  the  first-order  nonlinear  co 
efficient  can  be  written  as 

; 2 2 1 

(37)  a = -abrn'i  a>  - w 

1 1 c I 


At  the  critical  frequency,  the  departure  from  a linear 
system  occurs  only  through  the  second-order  and  higher 

2 3 

terms  In  >p  , i.e.,  a ^ ^ + ....  Therefore  at  the 

critical  frequency,  the  pavement  response  for  small  dynamic 
loads  should  be  nearly  linear.  The  critical  frequency 
depends  on  the  vibrator  characteristics  as  well  as  on  pave- 
ment properties.  The  connection  between  the  resonance 
frequency  and  the  critical  frequency  is  obtained  from 
Equation  23  as  follows: 


(38) 


2 4 : 2 

2 2 ubA  + neA  „{  c 

R c m \ 2m j 


where  is  the  resonance  angular  frequency. 

“R  " “c  • 


In  general 


In  addition  to  a critical  frequency,  w (P  ) which 

c s 

depends  on  the  static  load  of  the  vibrator,  there  is  a 

critical  static  load  P for  each  operating  frequency  of 

sc 


the  vibrator,  which  is  defined  from  Equation  35  by 


or 


(39) 


2 

k (F  ) = na 
0 sc 


Usinp;  Equations  21,  37,  and  39,  the  first-order  coefficient 

1 can  be  written  as 
1 

(40)  a = abi  k (F  ) - k (F  ) 

1 ! 0 s 0 sc 


(41) 
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It  is  possible  to  operate  a vibrator  at  the  critical  condition 
by  adjustlr.fr  either  the  frequency  or  the  static  load  of  the 
vibrator. 


An  approximately  linear  dynamic  deflection  versus  dynamic 
force  curve  occurs  at  the  critical  frequency#  ^or  an 
arbitrary  frequency,  the  departure  from  this  approximately 
linear  curve  is  positive  or  negative  depend  in*  on  the  siyn 
of  the  parameter  x In  Equation  28  and  37.  The  sisrn  of 


the  parameter  x depends  on  the  sip:n  of  the  parameter  b 

and  whether  < w or  P < F .It  can  be  shown  that  b 

c s sc. 


is  generally  negative  for  pavements.  For  the  case  b < 0 , 
which  corresponds  to  the  case  in  which  the  shear  modulus  of 
the  half-space  is  constant  with  depth,  or  to  the  case  of  a 
layered  system  which  has  G decreasing  with  depth,  as  is 
usually  the  case  with  pavements,  the  dynamic  stiffness  and 
deflection  versus  dynamic  force  curves  are  shown  schematically 
in  s’isrure  2a.  For  the  case  b > C , which  may  be  possible 
for  the  situation  in  which  the  shear  modulus  of  a layered 
system  increases  rapidly  with  depth,  the  dynamic  stiffness 
and  displacement  versus  dynamic  force  curves  are  shown 
schematically  in  Figure  -7b.  Therefore  the  choice  of  siprn 
b < 0 has  physical  relevance  to  pavement  problems.  For  the 
choice  b < 0 , the  sisrn  of  the  parameter  x can  be  positive 


or  negative  depending  on  whether  w < u>  or  oj  > u> 

c c 


respectively.  The  value  of  u can  be  determined  by  observ 

c 


in«r  the  frequency  which  produces  the  most  linear  load- 
deflection  curve.  The  alcrebraic  sisms  of  b and  e can  be 
determined  from  the  manner  in  which  the  dynamic  load-deflection 
curves  bend  away  (as  in  Figures  2a  and  2b)  from  the  approx- 
imately linear  load-deflection  curve  which  occurs  at  oj  = oj  * 

c 

It  should  be  pointed  out  that  the  definition  of  dynamic  stiff- 
ness is  the  ratio  of  the  dynamic  load  to  the  dynamic  deflection 
for  each  point  on  the  dynamic  load-deflection  curve. 


The  explicit  dependence  of  the  dynamic  stiffness  on  F^  is 

civen  by  Equations  28-3^.  These  equations  will  also  stive 

the  explicit  dependence  of  S on  F , provided  that  the  static 

s 
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equilibrium  displacement  x is  expressed  explicity  as  a 

e 

function  of  F by  using  Equation  10.  Because  Equation  10 
s 

is  an  equation  of  fifth  decree,  numerical  methods  are 

generally  required  for  its  solution.  However,  in  the  extreme 

of  very  large  and  very  small  values  of  F , analytical 

s 

solutions  of  this  equation  are  possible.  For  a very  small 
static  load,  the  equilibrium  elastic  displacement  is  given  by 


(42) 

wor  somewhat  larger 
itself  and  x may 


F 


values  of  F 


the  cubic  term  manifests 


s 

be  obtained  from  the  approximate  equations 


(43) 


The  discriminant  of  this  cubic  equation  is 


(44) 


2 3 

D = Is  + Vl 

2 3 

4b  27b 


and  is  negative  for  small  F when  b < 0 . For  the 

s 

condition  D < 0 , the  solution  of  the  cubic  equation  can 
be  written  as  (Reference  7) 


(4*5) 
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where  0 is  the  ancle  which  apoears  in  the  solution  of  this 
cubic  equation.  In  the  limit  of  small  f^  (or  small  b) , 


the  cosine  term  in  Equation  45  has  the  value 


( 4? ) 
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CombininF  Equations  4 5 and  4y  Fives 
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The  solutions  of  Equations  45  and.  48  have  been  derived  for 
b <0  and  are  therefore  applicable  to  pavements.  It  can 
be  shown  that  Equation  48  is  also  valid  for  b > 0 . 


With  IncreasinF  values  of  17 

5 


the  fifth-order  terms 


become  dominant, 

for  x is 
e 


(49)  x = 

e 


and  in  this  region  t ie  approximate 
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solution 


Equation  10  is  easily  solved  for  the  yeneral  case  of  an 

arbitrary  value  of  ? by  uslnp  a digital  computer.  A 

s 

schematic  praph  of  x versus  F for  pavements  (b  < 0) 

e s 

is  Fiven  in  PlFure  3a  while  the  correspondinF  Fraph  for  a 
b > 0 formation  is  yiven  in  Fiyure  3b. 

The  sprinF  constant  k Fiven  by  Equations  19-21  has  a 

.1  0 

conventional  interpretation  only  when  the  dynamic  deflection 

amplitude  satisfies  A <<  x . FQr  a / 0 the  sprinF 

e 
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constant  has  the  approximate  value  ^iven  by  Equation  21. 
Usinjr  Equations  19#  48,  and.  49,  the  value  of  k for  zero 

0 

dynamic  arcolitude  and  for  small  F is 

s 
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^or  very  small  F , Equation  50  can  be  rewritten  as 


(52) 
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while  for  very  larpre  F , Equation  51  can  be  rewritten  as 

s 


(53) 


k = Se 
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\ e 


Equations  10  and  21  are  easily  solved  simultaneously  on  a 
digital  computer  to  p-ive  the  general  solution,  k = k (F  ). 

0 0 s 

Figures  4a  and  b show  respectively  the  dependence  of  k on 

0 
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F for  pavements  (to  < 0)  and  for  b > 0 formations.  For 
s 

the  case  b < 0 , the  function  k exhibits  a local  minimum 

0 

value  for  some  value  of  F (or  x ). 

s e 

The  dynamic  stiffness  defined  by  Equation  2^  depends  on 
the  damping  constant  C of  the  pavement  as  well  as  on  the 
effective  soring  constant  k , This  damping  constant  is 
not  in  general  equal  to  the  damping  constant  for  the  homo- 
geneous linear  elastic  half-space,  C , that  was  defined  in 

H 

the  damping  constant  expression,  A theoretical  calculation 
of  C was  not  made  in  this  paper;  however,  the  nonlinear 
elastic  nature  of  flexible  pavements  gives  rise  to  a simple 
method  of  estimating  the  value  of  the  damping  constant. 
Equations  23*  and  show  that  when  <o  ~ ^c*  the  dynamic 

stiffness  has  the  critical  value; 


( 54)  S — -W  |ybA^  + peA^ : + 


* V 20 W 

Therefore  a measure  of  the  damping  constant  can  be  determined 
directly  from  the  dynamic  load  versus  deflection  curves  by 
measuring  the  critical  value  of  the  dynamic  stiffness.  An 
approximation  to  the  value  of  the  damping  constant  is  thus 
given  by 


(5*0  c = — 

"V  2a)c 


3,  SUMMARY,  A nonlinear  mechanical  model  describing  the 
dynamic  properties  of  a pavement-vibrator  system  has  been 
developed  which  describes  the  nonlinear  dynamic  response  of 
a pavement  to  a sinusoidal  loading  applied  to  the  pavement 
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surface.  Theoretical  expressions  are  developed  for  the 
dynamic  stiffness  of  a pavement  measured  by  a mechanical 
vibrator  which  are  expressed  in  terms  of  the  static  load, 
dynamic  load,  and  vibrator  baseplate  size  and  in  terms  of 
the  linear  and  nonlinear  pavement  parameters.  The  non- 
linear mechanical  model  gives  an  analytical  correlation 
among  the  values  of  the  dynamic  stiffness  measured  by 
different  vibrators  at  the  same  pavement  location.  Ex- 
perimental tests  were  done  to  determine  the  validity  of  the 
theoretical  pavement  response  model.  The  experimental  and 
theoretical  results  are  in  good  agreement  (Reference  8). 

The  nonlinear  mechanical  model  developed  in  this  paper 
vives  the  following  conclusions: 

a.  Third-  and  fifth-order  nonlinear  terms  in  the 
displacement  are  required  to  describe  the 
dynamic  load-deflection  response  of  actual 
pavement  systems. 

b.  The  theoretical  nonlinear  oscillator  model  of 
pavement  response  to  a dynamic  loading  shows 
that  stiff  pavements  have  a more  linear  dynamic 
load-deflection  curve  than  flexible  pavements. 

£.  At  specific  critical  frequencies  the  dynamic 
load-deflection  curves  become  essentially 
linear  at  low  values  of  the  dynamic  force. 
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DYNAMIC  DEFLECTION,  IN. 


NOTE;  CURVES  ARE  SKETCHES;  VALUES 
ARE  NOT  CALCULATED. 


STATIC  DISPLACEMENT,  IN.  STATIC  DISPLACEMENT,  IN. 


0,02 


STATIC  LOAD,  KIPS 
a.  CASE  b < 0 


0 5 10  15  20  25  30  35 

STATIC  LOAD,  KIPS 
CASE  b > 0 

Figure  3.  Theoretical  static  load-deflection  curves  predicted  hy 
the  nonlinear  spring  model* 
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EFFECTIVE  STATIC  SPRING  CONSTANT  ^ , 103  KIP5/IN. 


STATIC  LOAD,  KIPS 


o.  CASE  b < 0 (PAVEMENTS) 


STATIC  LOAD,  KIPS 
b.  CASE  b > 0 (SOILS) 


Figure  4,  Theoretical  dependence  of  the  effective  static  spring 
constant  on  the  static  load. 
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Application  of  the  Theory  of  Slender  Curved  Rods 
to  the  Analysis  of  Elastic  Yarns 
N . C . Huang 

The  Mathematics  Research  Center 
University  of  Wisconsin-Madison 

Abstract 

A systematic  formulation  of.  the  linear  theory  of  elastic  slender  curved 
rods  is  presented.  First,  equations  of  equilibrium  for  stress  resultants 
and  bending  and  twisting  moments  are  derived  utilizing  Serret-Frenet 
formula.  The  generalized  strains  are  then  defined  according  to  the  principle 
of  virtual  work.  Constitutive  equations  are  obtained  based  on  the  Euler- 
Bernoulli -Saint  Venant  assumptions.  The  theory  of  slender  curved  rods 
thus  formulated  is  applied  to  the  analysis  of  the  extensional  deformation 
of  elastic  two-piy  filament  yams  and  continuous  filament  yarns,  with  the 
filaments  in  the  yarns  treated  as  slender  curved  rods.  In  comparison  with 
previous  works  on  the  stress  analysis  of  yarns,  it  is  found  that  the  approach 
adopted  here  has  the  advantage  of  fewer  assumptions  and  hence  could  provide 
more  accurate  results  and  better  geometrical  and  physical  insights  into  the 
problem.  In  this  study,  the  yarn  elongation  and  filament  stresses  are 
determined  for  yarns  with  various  helical  angles.  The  effect  of  superposition 
of  a twisting  moment  on  i.he  axial  extension  is  also  investigated. 


Sponsored  by  the  United  States  Army  under  Contract  No.  DA-31-124-ARO-D-462. 
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1 . Introduction 


The  theory  of  the  extension  of  elastic  filament  yarns  has  been 
investigated  by  a number  of  authors  [ 1-4].  For  purpose  of  simplifying 
the  analysis,  assumptions  and  approximations  are  usually  imposed. 

Wilson  and  Treloar,  in  their  study  of  the  two-ply  filament  yarns,  employed 
the  theory  of  the  finite  deformation  of  a straight  circular  cylindrical 
rubber  rod  subject  to  combined  axial  extension  and  torsion  [ l ] . As  the 
filaments  in  a twisted  yarn  are  in  reality  not  straight,  corrections  had  to  be 
made  in  Wilson  and  Treloar1  s analysis  for  the  effects  of  noncircular  cross  sec- 
tion and  lateral  pressure  between  filaments.  In  the  studies  on  the  extension 
of  continuous  filament  yarns  with  circular  cross  sections  [ 2-4],  the 
following  assumptions  are  imposed,  (l)  The  cross  section  of  each  fiber 
is  assumed  to  be  infinitesimal  and  the  configuration  of  each  fiber  is 
considered  to  be  perfectly  helical  with  constant  radius  and  the  same 
number  of  turns  per  unit  length  of  the  yarn  axis.  (2)  The  fractional 
contraction  in  yam  diameter  is  assumed  to  be  uniformly  distributed  across 
the  yam.  (3)  The  shear  forces  and  moments  acting  on  all  faces  of  the 
yam  element  are  neglected.  (4)  The  stresses  at  any  point  in  the  yarn 
are  assumed  to  be  constant  in  all  directions  at  right  angles  to  the  fiber 
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axis.  Assumptions  (2)-(4)  are  employed  for  convenience  of  analysis. 

Their  validity  is,  however,  uncertain. 

In  this  paper,  we  shall  establish  a more  accurate  theory  for  the 
extension  of  long  elastic  filament  yarns.  The  filaments  in  the  yam  are 
treated  as  elastic  slender  curved  rods  with  the  center  line  of  the  rod 
travelling  along  a circular  helical  path.  Stress  analyses  of  yarns  are 
made  based  on  the  theory  of  slender  curved  rods  [ 5] . In  order  to  give  a 
comprehensive  presentation,  a survey  of  the  formulation  of  the  linear 
theory  of  elastic  slender  curved  rods  is  first  given.  The  rod  is  treated 
as  a one-dimensional  body.  Equations  of  equilibrium  are  derived  utilizing  the 
Serret-Frenet  formula.  The  strain  measures  are  defined  by  the  principle 
of  virtual  work  in  a manner  similar  to  that  used  by  Sanders  in  his  formula- 
tion of  nonlinear  theories  of  shells  [ 6],  Constitutive  equations  are 
derived  based  on  the  Euler- Bernoulli -Saint  Venant  assumptions. 

The  linear  theory  of  slender  curved  rods  thus  formulated  has  been 
applied  to  the  investigation  of  the  small  extension  of  two-ply  filament 
yarns  [7]  and  continuous  filament  yarns  [8].  A survey  of  these  studies 
is  presented  in  this  paper.  The  yarn  elongation  and  filament  stresses 
are  analyzed  for  yams  with  various  geometries.  The  effect  of  super- 
position of  a twisting  moment  on  the  axial  extension  is  also  discussed. 

In  comparing  our  investigation  with  previous  works  [ 1-4],  it  is  found  that 
the  analysis  presented  here  has  the  advantage  of  the  relaxation  of  assump- 
tions and  the  elimination  of  approximations.  Also,  it  is  found  that  the 
approach  adopted  here  can  provide  better  geometrical  and  physical  insights 
into  the  yarn  problems. 
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2.  Linear  Theory  of  Slender  Curved  Rods 


Let  us  consider  a slender  curved  rod.  The  rectangular  Cartesian 
coordinates  of  any  point  on  the  center  line  of  the  rod  are 

X.  = x.(s)  , i = 1,2,  3,  (2.1) 

where  s is  the  arc  length  measured  along  the  center  line  of  the  rod 
from  a fixed  point.  The  infinitesimal  arc  length  ds  satisfies 

ds2  = dx.dx.  , (2.2) 

l l ’ 

where  the  repeated  indices  represent  summation.  Let  us  denote  the 

unit  tangential,  principal  normal  and  binormal  vectors  by  |i,  and 

v ^ respectively  and  the  principal  normal  curvature  and  the  torsion  of 

the  center  line  by  « and  t respectively.  Hence 

= x!  , (2.  3) 

where  prime  represents  the  differentiation  with  respect  to  s.  By 

Serret-Frenet  formula,  we  have 

\|  = kji.  , (2.4) 

P.  = Tvi  ~ k\.  , (2. 5) 

vj  = -Tp.  . (2.6) 

In  the  following,  we  shall  consider  only  the  small  deformation  of  the  rod. 

Therefore  we  can  use  the  undeformed  rod  as  our  state  of  reference. 

Let  us  denote  the  components  in  the  u.  and  v . directions 

v l i 

by  the  subscripts  p and  v respectively.  The  distributed  force 
and  the  distributed  moment  m.  per  unit  length  of  the  rod  can  be 
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written  as 


P.  - P,  V.  + p p,  + p v.  , 
1 X l (I  I v 1 ? 


m.  = mV.  + m p.  + m v. 
i X i p i v l 


(2.7) 

(2.8) 


The  stress  resultant  F.  and  the  moment  M.  acting  at  any  cross  section 

l l 


of  the  rod  can  be  expressed  as 


F,  = F X + F p.  + F v.  , (2.9) 

i X i p i v i 

M.  = M.  X.  + M p.  + M v . * (2.10) 

l X l |i  l v l 

Also,  the  displacement  vector  IF  at  any  point  on  the  center  line  of 
the  rod  can  be  written  as 


U.  s U\.  + U U.  +U  v,  . ( 2 . 11) 

i X i pi  v i 

Let  us  consider  an  element  of  the  rod  of  length  ds.  The  conditions 
of  equilibrium  of  all  forces  and  moments  acting  on  the  element  lead  to 
the  following  equations: 

F!  + p.  = 0 . (2.12) 

l l ’ 

Mf  + e...XJF,  +m,  = 0 . (2.13) 

i ij  k j k i 7 

where  ^ is  the  permutation  symbol.  After  substitutions  of  equations 
(2.  7)-(2.1 1 ) into  equations  (2.12)  and  (2.13)  and  utilization  of  equations 
(2.4)~(2.6),  we  obtain  the  following  equations  of  equilibrium  of  forces 

F1-*V\  = 0' 

F'  + kF  - tF  + p = 0 , 

P X v p 

F'  + tF  + p = 0 , 
v p v 


221 


(2.14) 

(2.15) 

(2.16) 


and  the  following  equations  of  equilibrium  of  moments 


M' 

- kM 

+ m = 0 , 

(2.17) 

\ 

P 

X 

M' 

+ kM 

- tM  - F + m - 0 , 

(2.18) 

P 

X 

v v p 

M' 

+ tM 

+ F + m = 0 . 

(2.19) 

V P p V 


Let  6u  , 6u  7 6u  be  the  components  of  virtual  displacement  at 
X p v 

any  point  on  the  center  line  of  the  rod  and  b<p  , 6 <p  and  &<p  the 

a.'  p v 

components  of  virtual  rotation  of  any  cross  section.  By  equations  of 
equilibrium  (2. 14)  — (2. 19),  we  have 

L 

f [ 6u  (F'  - «F  + p ) + 6u  (F1  + kF  - tF  + p ) + 6u  (F*  + tF  + P ) 
q X X p X P M-  X.  v p v v p v 

+ 6 q>  (M1  - kM  + m ) + b<p  (M*  +kM  -tM  ~F  +m  ) 

XX  p X p p X.  v v p 

+ b<p  (M1  +tM  +F  +m  )]ds  - 0 , (2.20) 

V V p p V 

where  L is  the  total  length  of  the  rod.  After  integration  by  p^rts, 
equations  (2.20)  can  be  written  as 


f (p  6u  +p  6u  +p  6u  +m  f>©  +m  6 <p  +m  6 w )ds 

J0  \ X h*.  p v v x \ p p V V 

. i s = L 

+ (F  6u  + F 5u  + F 6u  +M6q  + M 6i p + M 6 q ) . 

XX  p p v v X X pp  v v S = 0 

L 

= f [F  6(u*  - kU  ) + F 6(u*  +kU  - tu  -q  )+F  6(u'  + tu  +<p  ) 
•i  XX  p p p X vvvv  PP 


+ M S(<p'  ~ ) + M 6(<p’  - T<p  +[<<p  ) + M 6(<p'  + t <p  )]ds  . 

XX  p ppv  X v v p 


(2.21) 
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The  left-hand  side  of  equation  (2.  21)  can  be  identified  as  the  external 
virtual  work.  Hence,  by  the  principle  of  virtual  work,  the  right-hand 
side  of  equation  (2.  21)  is  the  internal  virtual  work.  From  equation  (2.  21), 
we  may  define  the  following  strain  measures  for  the  slender  curved  rod: 

eX  = UX  “ KU|i  ’ (2'22) 

V = + KUX  - Tuv  - *v  > (2l23) 

V = u'  + TU  + (p  , (2. 24) 

V V fi,  |JL 

6 = <P[  ~ k<o  » (2*  25) 

K = <p*  - Tcp  + k<p.  , (2.26) 

p p v X, 

K = cpl  + Tw  . (2.27) 

V V (JL 

We  shall  call  the  axial  strain,  y and  the  components  of 

transverse  shearing  strain,  9 the  twisting  strain  and  K and  K the 

M*  ^ 

components  of  bending  strain. 

Based  on  Euler- Bernoulli -Saint  Venant  assumptions,  the  displace- 
ment components  at  any  point  in  the  rod  can  be  derived  by  the  super- 
position of  deformations  due  to  the  displacement  components  at  the 
center  line  of  the  rod,  the  components  of  rotation  of  the  cross  section 
and  the  warping  of  the  cross  section  introduced  by  the  twist  of  the 
rod.  Let  us  consider  any  cross  section  of  the  rod.  Set  the  origin  at. 
the  center  of  the  cross  section  and  two  coordinate  axes  in  the  p^  and  v 
directions.  The  coordinates  of  any  point  in  the  cross  section  are  denoted 
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by  p and  v.  Let  i , i and  i be  unit  vectors  in  the  tangential, 
r s’  p v ’ 

principal  normal  and  binormal  directions  respectively.  The  displacement 
vector  at  any  point  in  the  rod  is  given  as 

u = (u  - tup  - v(p  - 0ii/)i  + (u  - )i  + (u  + , (2.28) 

\ v p ' S p A.  p v K v 

where  ^ = i|j(s,  p,  v)  is  a warping  function  associated  with  the  twist 
of  the  rod.  It  can  be  shown  that  when  the  dimension  of  the  cross  section 
of  the  rod  is  small  in  comparison  with  both  1/k  and  1/t,  the  non- 
vanishing components  of  strain  at  any  point  in  the  rod  can  be  expressed 
as  [ 5] 


e 


£ 


e 


ss 

Sp 

SV 


H + vKp  ‘ ^ + *e' + [t(v  l;  ~ * av} + ^,]e  ’ 

2 [vp  “ V0  +(9^  + Kdj)0]  ’ 


(2.29) 


(2. 30) 

(2.31) 


The  material  of  the  rod  is  considered  to  be  linearly  elastic.  The  stress 


si 


components  a , tr  and  o-  are  related  to  the  strain  components 
ss  sp 

e , e and  e by 

ss  sp  SV 


Ee 


cr  -*  * 

SS  ss 


(T  = 2Ge  . 

s p s p 

<r  = 2Ga  , 

sv  s v 


(2. 32) 
(2.  33) 
(2.  34) 


where  E and  G are  Young's  modulus  and  shear  modulus  respectively. 
The  cross  section  of  the  rod  is  assumed  to  be  doubly  symmetrical  with 
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respect  to  the  coordinate  axes.  Hence,  the  warping  function  iJj  is 
an  odd  function  of  p and  v.  The  constitutive  equations  can  be 


obtained  by  the  following  area  integrals  extended  over  the  cross  section 
of  the  rod: 


fa  dA  = 
A 88 

EAs^  + EDt0  , 

(2.35) 

F = 

fa  dA  = 

GAy  , 

(2. 36) 

F 

A SfJ' 

P 

F = 

/ <r  dA  = 

GAy  , 

(2. 37) 

* V 

A SV 

I ( H-17 

A £V 

vu  )dA  = GJ0  , 
Sfl  ? 

(2. 38) 

M = 

f vo-  dA 

~ El  K , 

(2. 39) 

M- 

, ss 
A 

F F 

M - 

j po-  dA 

= El  K , 

(2.40) 

V 

ss 

A 

V V 

r 2 r 2 

where  A is  the  cross-sectional  area,  I = J v dA,  I - J p dA  are 

K A v A 


moments  of  inertia  of  the  cross  section  about  the  p and  v axes 
respectively, 


I = /(^-  v S+  * + v^dA  <2-41) 

A r 

is  the  torsional  constant  of  the  cross  section  and  D = I + I - J. 

p v 

Equations  (2 . 14)— ( 2 * 19) , (2.  2 2) -(2, 27)  and  (2.  3 5) -(2.  40)  are  the 
governing  equations  of  the  small  deformation  of  slender  curved  rods.  The 
boundary  conditions  can  be  obtained  from  equation  (2.21).  We  find  that  at 
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both  ends  of  the  rod,  it  is  necessary  to  prescribe  (1)  or  u^, 

(2)  F or  u , (3)  F or  u , (4)  M or  a>  , (5)  M or  <p  , 

V ' )JL  (JL7  ' V V*  \ \ fJL  (J. 

(6)  M or  0 . We  shall  employ  these  field  equations  to  analyze  the 
' v v 

deformation  of  elastic  yarns  subject  to  axial  extension  in  the  following 
sections. 


3 . Extension  of  Elastic  Two-Ply  Filament  Yarns 

Let  us  consider  a two-ply  filament  yarn  subject  to  an  axial  force 
F and  a twisting  moment  M.  We  consider  M to  be  positive  if  it  is 
in  the  direction  of  the  twist  of  the  yarn.  The  cross  section  of  each  fila- 
ment in  the  undeformed  state  is  assumed  to  be  circular  with  radius  a. 
The  center  line  of  the  filament  is  regarded  as  helical  with  the  length  of 
one  turn  of  the  twist  measured  along  the  axis  of  the  yarn  as  h = 2irk 
where  k is  a constant.  Let  us  denote  the  distance  measured  from  the 
yarn  axis  to  any  point  on  the  center  line  of  the  filament  by  r and  the 
helical  angle  of  the  center  line  by  0.  The  principal  normal  curvature 
and  the  torsion  of  the  center  line  of  the  filament  are  found  to  be 


1 . „ 1 
« — sm  G,  t = ~cos  0 , 

P P 

2 2 f -1  r 

where  p = (r  + k ')*  and  0 = tan  ~ . The  unit  vector  \ makes 
an  angle  0 with  the  yarn  axis.  The  unit  vector  p is  in  the  radial 
direction  and  toward  the  yarn  axis.  Hence  the  angle  between  p and 
the  yarn  axis  is  tt/2. 


(3.1) 
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The  filament  is  considered  to  be  a one-dimensional  body  of 
infinite  length.  We  shall  apply  the  field  equations  of  slender  curved 
rods  to  analyze  the  deformation  of  the  filament.  In  the  equations  of 
equilibrium  of  our  problem,  the  derivatives  of  the  stress  resultant  and 
moment  with  respect  to  the  arc  length  of  the  filament  must  vanish.  Hence, 
by  equations  (2. 14)— ( 2 . 19),  it  is  found  that 


V " = 0 ’ 

(3.2) 

v ■ V + p,i = 0 ’ 

(3.3) 

F t + p = 0 , 
P h v 7 

(3.4) 

M k - m =0 

p A.  7 

(3.  5) 

M « - M t - F +m  = 0 

\ V V \1 

(3.6) 

Mt  + F +m  = 0. 
F M- 

(3.7) 

The  strain-displacement  relations  are  given  by  equations  (2.  22)  — (2 - 27) 

and  the  constitutive  relations  are  given  by  equations  (2.  35)-(2.40)  with 

2 77  4 it  4 

A = ira  , I = I = 1 = ~ a , J = “ a and  D = 0. 

p v 4 2 

The  overall  equilibrium  of  the  internal  forces  and  the  applied  force 
F requires  that 

F = 2(F  cos  9 + F sin  6)  . (3.8) 

\ V 

Similarly,  the  overall  equilibrium  in  moments  requires  that 

M - 2(M,  cos  0 + M sin  9 + F,  r sin  0 - F r cos  0)  . (3.9) 

A.  V \ V 

Since  p is  in  the  radial  direction  and  the  yarn  is  long,  the 

components  u and  <p  must  be  independent  of  the  arc  length  of  the 
P P 


227 


filament.  If  we  set  the  origin  s = 0 at  the  mid- point  of  the  entire 
yarn,  we  may  write 

~ C1S’  \ “ C2S’  " C3S’  ^ v ~ C4S  ’ (3.10) 

where  c,,  c.,  c.,  and  c.  are  constants  to  be  determined.  After 
1’  2'  3 4 

substitutions,  we  obtain 


F.  = EA(c,  - u k)  , 

\ 1 p 

(3.11) 

F = GA(c  k - c t - c Js  , 
p ' 1 2 4 * 

(3.12) 

F = GA(c_  + u t + ^ ) 
v 2 p p 

(3.13) 

M = GJ(c  - <p  k)  , 

A.  |1 

(3.14) 

M = EI(c3k  - c4t)s  , 

(3.15) 

M = EI(c , + <p  r)  „ 
v v 4 pi 

(3.16) 

Since  the  yarn  is  assumed  to  be  long,  p and  m are  constant. 

\ K 

From  equations  (3.2)  and  (3.  5),  it  is  seen  that  F and  M are  also 

P P 

constant.  However,  equations  (3.12)  and  (3.15)  indicate  that  F and 

M are  proportional  to  s.  Therefore,  we  conclude  that  F = M =0. 
P P P 


From  equations  ( 

3.12)  and  (3.15),  we  find  that 

T 

(3.17) 

°3  = 7<C1K  ■ V>  ' 

°4  = OjK  - CZT  , 

(3.18) 

and  from  equations  (3.2),  (3.4),  (3.5)  and  (3.7),  we  find  that 

p.  = p = m.  = m = 0.  Hence  p is  the  only  nonvanishing  contact 

K v K v p. 

pressure  between  the  filaments,  i.e.  , the  common  normal  to  the  line  of 
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contact  of  the  two  filaments  is  in  the  radial  direction.  We  conclude 
that  the  line  of  contact  must  coincide  with  the  yarn  axis  and  the  two 
filaments  wind  about  the  yam  axis  following  helical  paths  as  shown 
in  Figure  1.  Accordingly,  we  have 

r = a,  p = a esc  0 . (3.19) 

In  the  deformed  state,  the  filaments  contact  by  a curved  surface. 

The  analysis  of  this  type  of  contact  problem  is  difficult.  In  our  analysis, 


we  assume  that  the  contribution  to  u due  to  the  elastic  contact  of 

P 

filaments  is  proportional  to  p . Hence,  we  have 

P 

era  „ y 

\ \ ' % ’ 


(3.  20) 


where  cr  is  Poisson's  ratio  and  £,  is  a positive  constant.  The  first 

term  in  equation  (3.  20)  is  due  to  the  contraction  in  diameter  of  the 

filament  through  Poisson's  effect.  The  distributed  moment  m can  be 

P 

interpreted  as  the  moment  caused  by  friction  due  to  the  relative  motion 

of  the  filaments.  The  magnitude  of  m depends  on  the  width  of  the 

P 

contact  surface.  Under  the  assumption  of  small  extension,  we  may  set 


m = 0. 
P 


After  substitutions,  equations  (3.  3)-(3.  6)  can  be  written  as 

+ i K'q - GAtvv - tv0’  (3-21) 

M , k - M T - GAy  = o . (3.22) 

\ V V 
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Let  u and  u be  the  displacement  components  of  the  filament 
0 z 

in  the  tangential  and  axial  directions  in  a cylindrical  polar  coordinates 
system.  It  is  found  that 


= ~(ru.  - ku  ) = 
p X.  v 

(CjK  - C2T)S  , 

(3.23) 

= — (kuN  + ru  ) = 
p X v 

(CjT  + C_,k)s  . 

(3.24) 

Hence  the  rotation  of  the  cross  section  of  the  yarn  about  its  axis  is 


w = 


„ p. 


2 r v 1 


(c.k  - c,t)s 


(3.25) 


Let  us  denote  the  extension  and  torsion  of  the  yarn  per  unit  length  by 
u and  w respectively.  We  have 

u = Cj  + c2  tan  0 , 

* 1 

w = ~ (Cj  tan  0 - c2)  • 


(3.26) 

(3.27) 


By  equations  (3. 17),  (3.18),  (3.  26)  and  (3,  27),  we  can  express  c^,  c^, 

c and  in  terms  of  u and  co  * By  equations  ( 3- 11),  (3,13),  (3-14), 

(3.16),  (2.  37)  and  (2.  24),  we  are  able  to  express  F F , M and  M 

K'  v K v 

# * 

as  linear  functions  of  u , « , u^  and  y^  with  coefficients  dependent 

on  0.  A set  of  simultaneous  linear  equations  of  u , u , u and  y 

’ ’ p,  v 

can  then  be  derived  from  equations  (3.21),  (3.22),  (3.8)  and  (3.9)-  Let 


us  introduce  the  following  dimensionless  quantities: 

Q = k/a,  i - E£,  u = 2TraEu  /F,  U*  = 2ttq2EuVf,  V*  = 27ra3EwV/F  , 

M* 


Y*  = 2-rra2Ey  /F,  m = H/(Fa),  f = 2F,  /F  . 

V \ 


(3.28) 
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The  helical  angle  e is  related  to  the  pitch  parameter  a by  0 = cot  a. 

Our  governing  equations  can  be  written  in  a form  of  AY  = Z where 

IN/  <V 

'Jt  jjs  ^ 

Y = [U  V uy  ] , Z = [001m]  and  A is  a four  by  four  matrix  whose 

#V  <V  /V 

elements  depend  on  0. 

5jc  % 

For  any  given  values  of  or,  a,  £ and  m,  we  can  find  U , V , 

ll< 

V 

u and  -y  by  solving  simultaneous  equations.  The  axial  force  carried 

by  each  filament  can  be  found  from  equation  (3.11)  as 

^ 2 « 2 
f = U cos  0 + V sin  0 cos  0 - u cos  0 . (3.29) 

In  our  study,  we  adopt  <r  = 0.4  and  £ = 1.07.  Two  problems 

are  being  considered  here.  In  the  first  problem,  the  ends  of  the  yarn 

are  clamped.  This  corresponds  to  the  case  of  tension  test  of  yarns.  In 

this  case,  V = 0 and  U , u,  y and  m are  unknown.  The 

calculated  values  of  U , m and  f are  plotted  as  functions  of  a in 

Figure  2 by  the  solid  lines.  Note  that  when  a approaches  infinity, 

both  U and  f approach  a limit  one.  When  a is  large,  the  filament 

are  nearly  straight.  "When  a is  small,  the  elongation  of  the  yarn  is 

essentially  governed  lay  the  change  of  the  helical  angle.  Between  these 

two  limiting  cases,  the  value  of  m may  reach  a maximum  value  as 

shown  in  Figure  2. 

In  the  second  problem,  the  ends  of  the  yarn  are  free  to  rotate.  This 
case  occurs  when  a vertical  yarn  is  fixed  at  the  upper  end  and  extended 
by  a weight  attached  to  the  lower  end.  In  this  problem,  m = 0 and 
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m.f  and  m Versus  a 
’ c 


Curves  for 


0.4  and  f;  =■  1.07. 


y(c  ^ 

U , V , u and  y are  unknown.  The  calculated  values  of  U , V 
end  f are  plotted  against  a in  Figure  2 by  the  dotted  lines.  The 

*Jc 

negative  value  of  V shows  that  an  untwist  occurs  during  extension. 

'I* 

When  a approaches  infinity,  both  U and  f approach  a limit  one 
and  V approaches  zero.  For  given  values  of  <r,  a and  £,  the 
quantities  U , V , u and  y are  linear  functions  of  m.  It  is  found 

that  U decreases  with  increasing  m.  When  m is  sufficiently 

# * 

large,  U becomes  negative.  The  value  of  m for  U =0  is  denoted 

by  m . It  is  found  that  m decreases  with  increasing  a as  shown 
c c 

In  Figure  2, 

4 . Extension  of  Elastic  Continuous  Filament  Yarns 

Let  ug  consider  an  idealized  infinitely  long  yam  of  circular  cross 
section  with  radius  R subject  to  an  axial  force  F and  a twisting 
moment  M at  both  ends  as  shown  in  Figure  3a.  The  yarn  is  composed 
of  a large  number  of  helical  fibers-  The  length  of  one  turn  of  twist  of 
all  fibers  is  regarded  as  constant  and  is  denoted  by  h = 2nk.  We  shall 
treat  each  fiber  element  as  a slender  curved  rod  of  helical  configuration. 
The  cross-sectional  area  of  the  yarn  element  cut  by  a plane  perpendicular 
to  the  yarn  axis  is  rdrdip.  Hence  the  cross-sectional  area  of  the  element 
perpendicular  to  its  own  axis  is  dA  ~ r cos  0drd<p,  where  9 is  the 
helical  angle  of  the  demerit.  The  principal  normal  curvature  and  the 
torsion  of  the  element  are  again  given  by  equation  (3.1).  The  components 
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of  the  stress  resultant  acting  on  the  cross  section  of  the  yarn  element 
are  denoted  by  dF, , dF  and  dF  and  the  components  of  the  moment 

\ p V 

acting  on  the  cross  section  are  denoted  by  dM, , dM  and  dM  . The 

A. p V 

components  of  the  distributed  force  and  the  distributed  moment  per  unit 
length  of  the  element  are  dp, , dp  , dp  , dm, , dm  and  dm  . Put 

\ P v'  k’  p V 

~ d 

( ) = ^ ( ).  Equations  of  equilibrium  for  a long  yarn  are  similar  to 
equations  (3.  2)-(3.  7).  They  are 

V " P\  = 0 ’ (4-!) 

F.k  - F r + p = 0 , (4.2) 

V + = 0 ’ (4-3) 

= 0 , (4.4) 

M,K-MT-F+m=0,  (4.5) 

X.  v v p 7 

M t \ F + m = 0 . (4.  6) 

p p V 

The  strain-displacement  relations  are  again  given  by  equations 
(2.  2 2)-(2.  27).  Based  on  the  same  reason  as  that  used  in  the  analysis 
of  the  two-ply  filament  yarns,  it  is  found  that  equation  (3.10)  is  also 
valid  in  the  case  of  continuous  filament  yarns. 

Let  us  denote  the  normal  compressive  stresses  in  the  p and  v 
directions  by  P and  Q,  the  normal  strain  components  in  the  u and  v 

directions  by  e and  e and  the  shearing  stress  and  shearing  strain 

M-  v 

corresponding  to  the  k,  v directions  by  T and  e,  . Here,  we  assume 

A.V 
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that  there  is  no  slipping  between  fibers.  Hence  T is  caused  by 

friction.  The  stresses  acting  on  an  infinitesimal  yarn  element  are  shown 

in  Figure  3b.  As  we  shall  see  later,  all  stress  components  not  shown 

in  Figure  3b  are  actually  zero.  From  Figure  3b,  it  is  found  that  p 

and  m are  related  to  P,  Q and  T by 
H- 

p = “ + “(P  - Q cos20)  , (4.7) 

P dr  r ’ 

m = T . (4.8) 

M- 

The  cross  section  of  the  yarn  element  is  infinitesimal.  It  can  be 
regarded  as  doubly  symmetrical.  The  fiber  is  considered  to  be  elastic  and 
transversely  isotropic.  We  have  the  following  constitutive  relations: 


ai 

e.  + aHc  + 

\ 2 p 

+ Er^TO  , 

(4.9) 

p = 

-( 

a e.  + a„e 

2 1 ip 

+ Vv*  ’ 

(4.10) 

Q = 

-(■ 

a„e.  + a ,e 
c \ 4 p 

+ a3e„)  , 

(4.11) 

/•v 

F = 

i 

acy  ) 

(4.12) 

M- 

2 

5 p 

F = 

V 

1 

2 

V,  ■ 

(4.13) 

T ^ 

a5 

- 

(4.14) 

where  a^(i  = 1,  2,  . . . , 5)  are  material  constants,  E = a^  - + a4) 

is  the  elastic  modulus  of  the  fiber  in  the  tangential  direction, 

2 

r = (I  + I - J)/A,  I and  I are  the  moments  of  inertia  of  the  cross 
d p v ' 7 p v 

section  and  J is  the  torsional  constant.  In  comparing  equation  (4.9) 

with  (2.  35),  it  is  found  that  an  additional  term  of  a _(e  + e ) is  added 

7 2 p v 
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to  include  the  effect  of  the  normal  strain  components  in  the  transverse 


direction.  The  constitutive  relations  for  M, , M and  M are 

\ p v 

1 2- 

M\  2 a5r\6  ’ 

- 2 

M - Er  K , 

p.  p p 

~ 2 

M = Er  K , 

V V V 7 


(4.15) 

(4.16) 

(4.17) 


where  r^,  r^  and  r^  are  radii  of  gyration  corresponding  to  the  moments 

of  inertia  J.  I and  I respectively, 
p v 

By  a similar  procedure  as  what  we  used  in  the  analysis  of  two-ply 


filament  yarns,  we  can  prove  that  F = M =p=m=0.  We  can 

p p \ \ 

also  show  that  equations  (3.26)  and  (3.27)  still  hold  in  this  problem. 
By  substitutions,  we  obtain 

^ 2 ;i;  ^ 

- Sj(u  cos  0 + r co  sin  0 cos  6 - sin  0)  + a,(e  + ?,  ) 


2 v ^ t.  / 

\X  V 


rd  * 3 v L 

+ E — (w  cos  0 - “ ^ sin  0 cos  0)  , 
P P 


(4.18) 


P - -a,.(u  cos^e  + rw  sin  0 cos  0 - sin  o)  --  a„e  - a e , (4.19) 

■g  p 3 p 4 v 


P 


0 = -a,(u  cos^G  + rco  sin  0 cos  0 - s.in  0)  - a ,e  - a c , (4. 20) 

£ p 4 p 3 v 


~ 1 * * 2 Uu 

F - ~ ar(u  sin  0 cos  9 - rco  cos  0 + cos  0 + o ) , 
v 2 5'  p p ’ 


(4.21) 


,,  i ^ ^ ^ jJL  > 

M.  = “a_i\(u>  cos  9 - ^ sin  6)  , 

A.  4 5 \ p 


(4.22) 


2 , 


<P 

_iF 


M = Er“(w  sin  0 cos  0 + cos  0)  . 
v v p 


(4.23) 
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From  equations  (4.  2),  (4.5),  (4.7),  (4.8),  (4.18),  (4.  22)  and  (4.  23), 
we  obtain 


- sin  e[a,(u ' cos20  + rco ' sin  0 cos  0 - sin  e)  + a (e  + e )] 

p i p & r-  v 

- “ cos  0 + ~~  +“  (P  - cp  cos^B)  = 0 . (4. 24) 

p dr  r 

In  the  derivation  of  equation  (4.  24),  we  consider  that  the  cross  section 

of  the  yarn  element  is  infinitesimal  and  neglect  those  terms  involving 

2 2 2 

r.,  r.  or  r . 

d’  \ v 

When  the  ends  of  the  yarn  are  clamped,  under  a pure  extension, 
the  helical  angle  of  fibers  decreases  and  separation  between  fibers  may 
appear  in  the  binormal  direction  in  the  region  r > r . However  in  the 
central  region  r < r,  this  separation  does  not  occur.  In  this  region,  it 
is  possible  for  us  to  find  e ,,  e _ and  in  terms  of  u , w and 


P 


u by  the  transformation  of  the  strain  tensor  from  a rectangular  Cartesian 
P 

coordinates  system  to  the  (\,  fi,  v)  system.  It  is  found  that  in  the  region 
without  separation  of  fibers, 

e = - — sin20  + u ' cos^ 0 + rw  ' sin  6 cos  0 , (4.  25) 

X.  r 


du 

_ _ 14 

'p  dr  ’ 

U 


£,  = 


u 2 * 2 * 

— ^ cos  0 + u sin  9 - r«  sin  0 cos  0 , 


(4. 26) 

(4. 27) 


u 

H 


_■  - sin  0 cos  0 4 u jlu  wm  v ■ * 
\v  r ^ 


* 2 2 

sin  0 cos  Q + w (sin  0 - cos  0)  . (4.  28) 
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Denote  the  ratio  of  the  cross-sectional  area  of  the  void  in  the  yarn 
to  the  total  cross-sectional  area  of  the  yarn  by  y.  Let  us  introduce 
the  following  dimensionless  quantities: 

i - r/R,  l - t/R,  C = k/R,  q(£)  = (c2  + £2)2,  ^ = r](l),  q ~ q(£),  a.  = a./E 
(i  = 1,2,  ...,5)  , 

f = F/[  2(1  - -y)ttER2]  , mt  = M/(FR),  u = u /(fR),  e = e^/f,  ^ = \/(Ef)  , 
if  = u7f,  V'"  = c «Vf,  P = P/(E£),  q - Q/(Ef),  t = T/(Ef),  ( )•  = ^().  (4.29) 

Note  that  U = 2E/E  , where  E is  the  overall  elastic  modulus  of 

y y 

2 

the  yarn.  It  is  found  that  a,  = 1 + Za  /{a  + a A. 

1 Z 3 4 

The  governing  differential  equations  for  the  deformation  of  the  yam 
in  the  region  without  separation  can  be  derived  from  equation  (4.14), 


(4.19),  (4.  20)  and  (4.  24)-(4.  27)  as 


+ fllU  + f12P  + gUU  + 912V  = ° ’ 


P'  + f21U  + £22P  + 921U  + g22V  = ° ’ 


(4.30) 


(4.31) 


wnere  f and  g (i  = 1,  2 and  j = 1, 2)  are  functions  of  a.,  c,  q 
i]  ij  ’ i’  ’ 1 

and  £.  After  u is  determined,  the  fiber  stress  f can  be  determined 
by  equations  (4.18),  (4.26)  and  (4.27)  and  the  contact  pressure  q can 
be  determined  by  equation  (4.  20),  (4.  6)  and  (4.  27). 

For  r > r,  separation  of  fibers  occurs  in  the  binormal  direction. 

In  this  region,  equation  (4.  27)  and  (4.  28)  are  no  longer  valid.  Flowever, 
we  may  set  Q = T = 0.  It  is  found  that  equations  (4.18),  (4.19),  (4.25) 
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and  (4,  26)  are  still  valid  but  equations  (4.  20).  (4.  7),  (4.  8)  and  (4.  24) 
must  be  replaced  by  the  following  equations: 

* 2 * u 

aJu  cos  0 + rw  sin  0 cos  0 - — ^ sin  0)  + a ,e  + a„e  = 0 , (4,  32) 

2 p 4 [x  3 v ’ 


dP 

13  p.  dr 

m = 0 . 


(4. 33) 

(4.34) 


1 # 2 Li  d P P 

~ sin  G[  a,(u  cos  9 + ru  sin  0 cos  0 - ~ sin  0)  + a Jt  + t ) 1 + T-  +~  = 0,  (4.  35) 

p 1 p 2'  [i  v 1 dr  r 

The  governing  differential  equations  can  be  obtained  from  equations  (4.19), 

(4.  32),  (4.26)  and  (4.  35).  These  equations  can  be  expressed  in  the 

same  form  as  equations  (4.30)  and  (4.31).  After  u is  determined,  f 

can  be  found  from  equations  (4.18),  (4.26)  and  (4.32). 

In  order  to  solve  for  the  differential  equations  (4.  30)  and  (4.  31), 
we  need  the  boundary  conditions  at  r = 0 and  r = R and  the  continuity 
conditions  at  r = r.  These  conditions  are 


u (0)  = P(R)  = 0 , 

r* 


V’r")=  V?+>  ’ 


(4.36) 

(4.37) 

P(r  ~)  = P(r  T)  . (4.  38) 

These  conditions  can  be  easily  expressed  in  terms  of  u and  u'.  The 
correct  choice  of  £ must  make  q(£  ) = 0.  This  can  be  done  by  a 
cut-and-try  process. 

•A. 

T* 

In  equations  (4.  30)  and  (4.  31),  two  unknown  constants  U and 
V are  involved.  These  constants  are  determined  by  the  overall 
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equilibrium  conditions  of  the  yarn.  From  Figure  3b,  it  is  found  that 
the  applied  axial  force  F and  twisting  moment  M can  be  expressed 
by  the  following  integrals: 

R 


F = 2ir(i  - y)  / [ (F  cos  0 + F sin  0)cos  9 


+ (T  cos  9 - Q sin  0)sin  0]rdr  , 


(4.  39) 


R 


M - Zrr (1  - y ) f [(M,  cos  0 + M sin  0 + F.  r sin  0 - F r cos  0)cos  0 

+ (Tr  sin  0 + Or  cos  0)sin  0]rdr  . (4.40) 

In  the  derivation  of  equations  (4.  39)  and  (4.40),  the  rule  of  mixtures  of 
y-diluted  single  fiber  property  has  been  employed.  By  substitutions, 
equations  (4.  39)  and  (4.  40)  can  be  written  as 

I 1 

/ t^udS  + J H2ud£  + cuu‘  + c12V‘  + Fxu(|)  + G1u(l)  = 1 , (4.41) 


i i * * _ 

J H3ud^  + J H4ud£  + c21U  + c22V  + F2u(|)  + G?u(l)  = mt  , (4.42) 

I 

where  th(i  - 1,  2,  3,  4)  are  functions  of  o\ , § and  q,  c (i  = 1,  2 

and  j = 1,  2),  F^  and  F2  are  constants  dependent  on  a , c,  £,  q 

and  q^  and  and  G2  are  constants  dependent  on  , c and  q , 

Numerical  solution  .is  first  sought  for  the  case  of  yarns  with  clamped 

ends.  In  this  case,  V =0.  In  our  calculation,  we  use  a.  = 1.16, 

a = 0.4,  a = 1.2  a.  - 0.8  and  a r = 0.3. 

2 ’ 3 ’ 4 5 
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In  Figure  4,  the  nondimensional  radial  displacement  and  the  fiber 
stress  are  plotted  against  the  radial  coordinate  for  different  values 
of  c as  indicated  on  the  curves.  The  positions  £ = £ are  shown 
by  dark  circles.  Note  that  in  the  region  without  separation  of  fibers, 
u(£)  is  approximately  proportional  to  £.  However,  this  proportionality 
does  not  appear  in  the  region  with  separation  of  fibers.  As  we  may  expect, 
the  maximum  Fiber  stress  occurs  at  the  center  of  the  yarn.  It  is  found 
that  when  c increases,  the  helical  angle  of  the  yarn  decreases  and 
hence  both  the  radial  displacement  and  the  maximum  fiber  stress  decreases. 
The  contact  pressure  p(£)  and  q(£)  and  the  shearing  stress  t(£) 
are  plotted  against  £ in  Figure  5.  At  the  center  of  the  yam,  p(0)  = q(0) 
and  t(0)  = 0.  In  the  region  of  the  separation  of  fibers,  q(£)  = t(£)  = 0. 

The  normalized  axial  strain  U and  the  twisting  moment  at  the 
fixed  end  m are  plotted  against  c in  Figure  6.  When  c approaches 
zero,  the  elongation  of  the  yarn  is  dominated  by  the  change  of  the  helical 
angle  and  hence  both  U and  m approach  infinity.  On  the  other  hand, 
when  c approaches  infinity,  all  fibers  become  straight.  Hence  U 
approaches  2 and  m^  approaches  zero. 

The  relative  modulus  (RM)  is  defined  as  the  ratio  E ^/E.  In 
Figure  7,  the  normalized  relative  modulus  with  respect  to  the  modulus  of 

a yarn  of  helical  angle  10°  is  plotted  against  the  helical  angle  of  the  yarn. 

2 

The  curve  based  on  a cos  a approximate  theory  [2]  and  the  curve 
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obtained  by  Hearle  et  al.  [ 3,4]  are  also  shown  for  comparison.  The 
experimental  values  are  found  to  be  scattered  .in  the  vicinity  of  the 
theoretical  curves  as  a result  of  sensitivity  of  the  relative  modulus  to 
material  properties  for  different  yarns. 

The  yarn  is  also  sensitive  to  the  twisting  moment  applied  at  its 
ends.  It  is  found  that  the  separated  fibers  tend  to  close  again  as  a 
result  of  the  application  of  a positive  twisting  moment  of  small  magnitude. 
Our  analysis  can  be  applied  to  investigate  the  behavior  of  the  yarn  after 
all  fibers  are  closed.  In  Figure  8,  the  value  of  U and  V for  a yarn 
with  c = 3 and  closed  fibers  are  plotted  against  m . From  those 
curves,  we  can  draw  the  same  conclusion  as  what  we  have  made  in  the 
problem  of  the  twTo-ply  filament  yarns,  i.e.  the  overall  axial  strain  of 
the  yarn  is  reduced  due  to  the  superposition  of  an  additional  twisting 
moment.  When  mt  is  sufficiently  large,  separation  of  fibers 
may  occur  in  the  principal  normal  direction  and  buckling  of  the  yarn  may 
be  introduced.  The  investigation  of  the  elastic  stability  of  yarns  is 
beyond  the  scope  of  this  paper. 
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Abstract 

A maximum  likelihood  decision  algorithm  is  described 
for  Markov  sequences  and  independent  observations.  At 
least  six  different  applications  of  this  algorithm  to  the 
field  of  digital  communications  have  been  implemented  or 
proposed.  The  three  most  important,  convolutional  coding, 
intersymbol  interference,  and  voice  compression,  are 
summarized  and  discussed. 
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1.0  The  Basic  Algorithm 

Decision  theory,  Bayesian  or  otherwise,  has 
classically  been  concerned  with  the  testing  of  hypotheses 
involving  disjoint  events,  which  are  either  unique  or  a 
sequence  of  successively  independent  events.  For  example, 
in  reliability  testing  the  hypotheses  concern  the  quality 
of  the  lot.  In  radar,  it  is  whether  a target  is  present 
or  absent,  and  if  more  than  one  target  is  to  be  tested, 
successive  events  are  taken  to  be  independent.  In  block 
coded  digital  communications  M hypotheses  regarding  the 
messages,  or  codewords  of  the  block  code,  are  to  be  tested, 
but  successive  codewords  are  assumed  to  be  independent 
of  one  another. 

On  the  other  hand,  there  are  numerous  interesting 
applications  of  decision  theory  for  which  the  hypotheses 
correspond  to  the  states  of  a Markov  chain.  Figure  1.1 
represents  a fairly  general  system  model  of  such  applications. 
For  a memoryless  channel  each  term  of  the  observation 
sequence  depends  only  on  the  corresponding  term  of  the 
transition  sequence.  That  is,  y(k)  depends  only  on  £ (k) 
and  not  on  previous  and  successive  transitions. 
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Markov  Source 


State  Sequence 
x (0) x (1) x (2) . . .x (k)  ... 

Transition  Sequence 
U1H(2)  . . .£(k)  . . . 


Memory less 
Channel 


Observation  Sequence 
y (l)y (2) . . .y (k)  ... 


x <->  | 


Figure  1.1  General  System  Model 


Initial  State  x(0j  = aQ 
Final  State  x(L)  = aQ 


Figure  1.2.  State  Diagram  {only  permissible  transitions  shown) 


The  Markov  source  generating  the  state  sequence 
and  hence  the  transition  sequence,  which  is  in  one-to-one 
correspondence  with  the  state  sequence,  can  be  depicted  by 
a Markov  state  diagram  as  shown  in  Figure  1.2.  Only  the 
permissible  transitions  are  shown;  all  others  are  assumed 
to  have  transition  probability  zero.  With  very  little 
loss  of  generality,  the  initial  and  final  states  of  the 
source  are  both  taken  to  be  state  aQ.  In  practice  even 
this  requirement  can  be  avoided.  Superficially  the  model 
of  Figure  1.1  is  similiar  to  the  estimation  theoretic 
model  upon  which  Kalman  filtering  is  based.  However,  the 
Markov  source  here  generates  a finite  state  sequence  while 
the  estimation  theoretic  case  involves  a real  sequence . 

Table  1.1  contains  a derivation  of  the  maximum 
likelihood  or,  more  precisely,  the  maximum  A Posteriori 
decision  rule.  The  key  expression  contained  in  the  box  is 
derived  on  the  basis  of  three  simple  observations. 

(1)  There  is  a one-to-one  correspondence  between 
the  state  sequence  and  the  transition  sequence.  Hence, 
the  vector  x can  be  replaced  by  £ at  any  point; 

(2)  the  channel  is  memoryless  and  hence  the 
conditional  probability  of  y(k)  given  £ (k)  can  be  factored 
out  from  the  expression  of  the  joint  conditional  probability 
from  1 to  K; 
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(3)  the  source  is  Markov  and  hence  the  Kth  state 

depends  only  on  the  (K-l)th. 

We  then  seek  to  maximize  the  expression  within  the 

box  to  determine  the  maximum  a posteriori  probability 

sequence  terminating  at  state  a^  at  time  K.  With  the 

definition  of  and  X^,  it  then  follows  easily  that  the 

maximum  at  time  K to  state  a^  must  have  been  a maximum  to 

some  state  a^,  i e (1,  2,  . J) , at  time  K-l.  This  gives 

rise  to  the  recursive  algorithm  shown  at  the  bottom  of 

Table  1.1,  where  A.  (a.)  is  called  the  path  metric,  being 

K J 

proportional  to  the  logarithm  of  the  maximum  a posteriori 

probability  for  paths  terminating  in  state  a^  at  time  K, 

and  it  corresponds  to  a path  which  we  call  tt,  (a.)  . That 

k j 

path  must  have  as  its  last  term  the  symbol  a^  and  all  previous 
terms  depend  on  the  history  of  the  path;  that  is,  the  states 
at  which  it  resided  at  previous  times.  Initial  and  final 
conditions  are  derived  from  the  fact  that  we  have  assumed 


state  art  to  be  both  the  initial  and  final  state.  Hence 

the  logarithm  of  the  probability  of  aQ  at  0 is  0,  and  for 

any  other  state  at  time  0,  it  is  -00.  Similarly,  the  final 

condition  reflects  the  fact  that  with  probability  1 the 

final  transition  must  be  to  state  a . 

o 
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Table  1.1  Maximum  A Posteriori  Decision  Rule 


1-1 

State  Sequence  x < > Transition  Sequence  £ Observation  Sequence  y:  £(k)  ■*  yOO 


x (k)  e {aQ,  a2  — aj)  x(0)  = aQ  x(L)  = aQ 

P[x(l) .. .x(K) Jy {!) . . .y{K)]  = Ptx(l) ...x(K) , y (1) . . .y (K) ]/P[y (1) . . .y  (K) ] 


P[y(l)...y(K),  x{l)...x(K)]  = P[y(l) ...y(K) , £(!)...  C (K) ] - P [y (1) . . .y (K) \ i (1) . . . 5 (K) ]P [x{l) . . .x(K) ) 
= P[y {!)... y(K-l)  U(l) ...C(K-l)]P[y(K) |e(K)lPIx(l) . . .x(K-l) ]P [x{K) 1 x (K-l) ] 


log  Pty(l) ...y(K) , x ( 1 ) . . .x (K) ] = log  P[y(l) ...y(K-l) , x(l) ...x (K-l)]  + log{P[y(K) |£(K) ]P[x(R) |x(K-l)]J 


Define  A„{a.)  = Max  log  P[y (1) . . ,y (K) , x(l)...x(K)]  = log  P [y (1) . . iy (K) , x ( 1) . . . x (K-l) a . ] 

all  paths 
terminating 
at  state  a, 
at  time  K ^ 

(a^-*a  j ) = log  {P  ]y  (K)  | £(K) :a^  -*■  a^]P  [x(K)  =a^|x(K-l)  = ai]  } 

Recursive  Algorithm; 

Aw(a.)  = Max  [Av  ■+  a.)]  Path  Metric  for  state  a.  at  .*me  K 

J iell, 2, . . . JJ  J a A 

for  path  tt  (a.)  = x (l)x  (2)  . . .x  (K-l)  a . 

K 3 3 

Initial  Conditions: 

Ao(aQ)  = 0;  AQ(ai)  = -»  Vi  / 0 


Final  Condition: 


A pictorial  view  of  the  algorithm,  which  also 
represents  its  mechanization,  is  shown  in  Figure  1.3. 

Both  the  state  metric  and  state  path  memory  are  stored  in 
registers  which  are  continually  updated.  Thus,  at  time 
K+l,  for  example,  the  path  metric  for  state  a2  is  chosen  as 
the  maximum  of  the  path  metric  at  time  K for  state  a^ 
incremented  by  the  branch  metric  for  the  transition  from 
a^  to  and  of  the  state  metric  at  time  K for  state 
a 2 incremented  by  the  branch  metric  for  the  transition 
from  a2  to  itself.  Depending  on  which  of  these  two  terms, 
which  we  call  and  r2,  is  greater,  the  corresponding 
state  path  memory  is  chosen  and  incremented  by  a2,  which 
corresponds  to  the  state  that  we  are  looking  at,  at  time 
K+l.  Thus  the  registers  for  the  state  path  memory  are 
shown  of  indefinite  length  and  they  grow  linearly  with 
time.  Even  this  requirement  can  be  avoided  in  practical 
systems . 

We  now  proceed  to  describe  three  major  applications 
of  the  algorithm. 


257 


2.0  First  Application:  Convolutional  Codes 

The  most  widespread  and  significant  application  of 
the  algorithm  just  described  is  for  convolutional  codes 
which  improve  performance  on  a noisy  channel.  Superficially 
a convolutional  code  generator  is  very  similar  to  a block 
code  generator,  as  shown  in  Figure  2.1.  The  only  difference 
is  that  whereas  a block  code  generator  takes  K successive 
bits  and  generates  N channel  symbols,  a convolutional 
code  generator  enters  one  bit  at  a time  into  the  encoder 
register  and  generates  n bits  where  n is  on  the  order  of 
~ in  the  block  code  case.  Actually  the  diagram  in  Figure  2.1 
can  be  generalized  to  the  case  where  k > 1 bits  are  entered 
at  one  time,  but  k is  much  less  than  K in  most  cases  of 
interest. 

The  convolutional  coder  can  be  regarded  as  a linear 
finite  state  machine  whose  states  constitute  a Markov 
sequence.  In  fact,  the  Markov  graph  of  Figure  1.2  applies 
to  the  convolutional  code  of  constraint  length  K = 3,  and 
for  all  convolutional  codes  of  rate  1/n  the  state  or  Markov 
diagram  is  connected  in  a binary  fashion;  that  is  from 
each  state  there  are  branches  going  to  two  states  and  branches 
coming  to  it  from  two  states.  As  shown  at  the  bottom 
of  Figure  2.1,  the  state  xO  has  branches  going  to  lx 
and  Ox  where  x is  an  arbitrary  K-2  dimensional  vector; 

•N, 

similarly  state  xl  has  branches  going  out  to  the  same  two 
states. 
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Figure  2.1  First  Application:  Convolutional  Codes 


ro 

CTi 

o 


Block  Code:  Fill  Register  with  K bits  Convolutional  Code,  Contraint  Length  K, 

Generate  N Outputs  Input  1 bit  at  a time 

Rate  R = K/N  < 1 Generate  n outputs 

R = 1/n 


N Outputs  n outputs 


{May  generalize  to  k < K inputs  for  each 
n outputs:  R = k/n) 

MAP  Decision  Rule:  MAP  Decision  Rule: 


Hypotheses  <— > Disjoint  Events 

Classical  Solution  involves  comparison  of 
K 

2 ' likelihood  functions  or  A Posteriori 
probabilities 


Hypotheses  <— > Markov  Sequences 
K —1 

2 States  correspond  to  Register  Contents 
Permissible  State  Transitions: 

xG 
xl 


e.g*  For  K'  3,  state  diagram  of  Figure  1.2 


Table  2.1  represents  the  key  asymptotic  results  in 
comparing  block  and  convolutional  coding.  For  all  but  a 
very  small  rate  region/  the  lower  bound  exponent  E is 
approximately  equal  to  the  upper  bound  exponent  E.  The 
most  important  difference  in  performance  between  block  and 
convolutional  codes  is  that  whereas  the  block  code 
exponent  E^  is  a positive  convex  function  for  all  rates 
less  than  capacity,  the  convolutional  code  exponent  E (R) 
is  concave  for  most  channels,  and  certainly  for  all 
channels  of  interest. 

Figure  2.2,  in  fact,  shows  these  exponents  for  a 
typical  channel  (e.g.,  the  additive  Gaussian  channel  or 
the  binary  symmetric  channel) . The  complexity  of  the 
decoder  T,  as  already  observed  and  indicated  in  Figure  1.3, 
grows  as  2 since  this  is  the  storage  required  as  well  as 
the  number  of  computations  and  comparisons  per  bit. 

Similar  observations  can  be  made  for  block  codes.  Thus 
the  error  probability  decreases  as  T ' 1 ' . For  this 

reason  convolutional  codes  outshine  block  codes  of  the 
same  complexity. 

As  a practical  example,  for  the  convolutional  code 
of  constraint  length  K = 7 and  rate  1/2,  the  required 

channel  signal-to-noise  for  a bit  error  probability  of 

-5 

10  is  reduced  by  5 dB  compared  to  that  required  without 
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Performance  Comparison 

Block  Coding  Theorem  (Shannon  et  al) 


Table  2.1 


For  any  memoryless  channel  and  any  K,  3a  block  code  of  length  K,  rate  R,  and  2K  codewords 
for  which  the  bit  error  probability 


-(K/R)  E,  (R) 

PB  < 2 

where  Efa(R)  is  a positive  convex  function  V R < C,  channel  capacity.  Conversely,  for 
every  code  with  these  parameters 

-{K/R)  [E,  (R)  + o{K)  ] 

PB  > 2 

ro  rr) 

[For  most  channels  and  most  rates  of  interest,  E^tR)  2 E^fR)] 

Convolutional  Coding  Theorem 

For  any  memoryless  channel  and  any  K,  3 a convolutional  code  of  constraint  length  Kf  rate  R, 
and  2 states  for  which  the  bit  error  probability 

- (K/R)  [Ec(R)-o(K)  ] 

PB  < 2 where  EC(R)  is  a positive  function  which  dominates  Eb(R)  V R < C 

For  most  channels  of  interest  (e.g.  symmetric  channels)  E^ (R)  is  concave.  Conversely, 
for  every  code  with  these  parameters 

-(K/R) [E- (R)  + o(K) ] 

V 2 

IFor  most  channels  and  most  rates  of  interest,  eL(R)  = E_(R)] 


For  Typical  Channel 

E(R) 


ro 

CTl 

CO 


Practical  Results 

For  K = 7,  Rate  1/2  convolutional  code  on  coherent  Gaussian  channel,  required  channel 
signal-to-noise  ratio  reduced  by  5 dB  for  Pg  = 10~5  relative  to  uncoded  operation. 
Equal  complexity  block  codes  gain  only  2 to  3 dB  over  uncoded  operation . 


Figure  2.2 


coding;  for  block  codes  of  the  same  complexity,  the 
reduction  is  only  oh  the  order  of  2 to  3 dB.  Actual 
hardware  with  the  above  parameters  and  performance,  which 
is  presently  operational  is  shown  in  Figure  2.3.  This 
was  developed  by  LINKABIT  Corporation  for  the  U.S.  Army 
Satellite  Communication  Agency  at  Ft.  Monmouth.  Of  major 
significance  is  the  fact  that  this  equipment  is  capable 
of  decoding  at  information  rates  up  to  10  megabits  per 
second  which  are  necessary  for  high  capacity  and  multiple 
access  satellite  communications.  Figure  2.4  shows  the 
same  decoder  in  the  upper  right  hand  corner,  with  the  PSK 
modem  with  which  it  was  initially  tested.  The  size  of 
the  coder-decoder  is  approximately  one-quarter  of  that  of 
the  modem  and  yet  it  gains  a 5 dB  advantage  over  the  modem 


alone. 


Figure 


3.0  Second  Application:  Channels  with  Intersymbol 

Interference 

The  second  most  significant  application  of  the 
maximum  likelihood  decision  algorithm  of  Section  1 is  to 
the  improvement  of  performance  on  channels  which  exhibit 
intersymbol  interference.  Such  interference  can  be  caused 
by  either  of  two  physical  phenomena  illustrated  by  Figure  3.1: 

(a)  a bandlimited  channel  where  successive  symbols 
are  rendered  dependent  by  the  linear  filtering  characteristics 
of  the  channel.  The  coefficients  decay  more  slowly  as 

the  bandwidth  of  the  channel  decreases.  An  example  of  such 
a channel  results  from  transmission  of  digital  data  over 
bandlimited  telephone  lines. 

(b)  multipath  channel,  as  occurs  in  HF  ionospheric, 
or  tropospheric  propagation.  Whereas  in  case  (a)  the 
coefficients  are  usually  constant  over  a message 
transmission  or  reasonable  portion  thereof,  in  case  (b) 
the  coefficients  are  randomly  time  varying,  although  the 
rate  of  change  is  generally  low  compared  to  the  data  rate. 

In  either  case,  the  coefficients  must  be  estimated 
either  initially  or  continually  in  order  to  establish  the 
model.  Rapidly  converging  estimation  algorithms  for  this 
purpose  are  well  known  and  have  been  thoroughly  covered 
in  the  literature.  Once  the  model  has  been  established. 
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Figure  3.1  Second  Application: 
Discrete  Baseband  Model 


Channels  with  Intersymbol  Interference 

a)  Bandlimited 

b)  Multipath 


Transmitted 


a)  For  Bandlimited  Channel  {ctj^  are  fixed 

b)  For  Multipath  Channel  randomly  time  varying.  Estimation  of  parameters  from 

observations  required  - various  algorithms  with  reasonably  rapid  convergence 


Markov  Model  valid  in  either  case 


it  is  clear  that  the  shift  register  generates  a Markov 
sequence.  The  difference  between  this  and  a convolutional 
encoder  is  that  the  linear  operations  are  over  the  real 
number  field  rather  than  over  GF(2).  However,  for  a binary 
information  sequence,  u(l),  u (2 ),...,  u(K) , in  the  register 
the  state  of  the  input  is  the  same  as  the  state  of  the 
corresponding  input  to  a binary  convolutional  coder. 

Hence,  the  same  algorithm  applies  to  trying  to  determine 
what  sequence  was  transmitted,  given  the  received  or 
observed  sequence  y (k) , which  exhibits  both  the  effect  of 
the  intersymbol  interference  and  additive  noise. 

Table  3.1  illustrates  the  key  theorem  regarding  the 
performance  of  the  algorithm  for  intersymbol  interference 
channels.  PQ  is  the  error  probability  for  the  simple 
memory  less  additive  white  Gaussian  channel,  while  Pj. 
is  the  probability  with  intersymbol  interference.  The 
coefficients  K and  K,,  are  relatively  small  constants, 
independent  of  the  channel  energy-to-noise  e/NQ,  and 
generally  within  an  order  of  magnitude  of  each  other. 

For  y = 1,  the  performance  would  be  almost  the  same  as 
without  intersymbol  interference.  In  all  cases,  of 
course,  y £ 1,  but  the  key  result  is  that  for  K = 1 
(i.e.  the  two  tap  case) , y = 1.  Hence,  there  is  little  or 
no  degradation  when  the  multipath  channel  consists  of  two 
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Table  3.1  Performance  of  Algorithm  for  Intersymbol  Interference  Channels 


Theorem  (Forney):  The  Maximum  A Posteriori  Decision  Algorithm  applied  to  a two-level 

sequence  received  over  channels  with  intersymbol  interference  yields  a symbol  error 
probability 


Vo(Ye/V  < PI  < Vo'^V 

where  K.  and  K are  independent  of  the  channel  signal-to-noise  ratio  (e/N-) , (and  are 

U O 

generally  jwithin  an  order  of  magnitude  of  each  other) 

Pq(e/No)  is  the  error  probability  without  intersymbol  interference  and  y 1. 

For  K = 1 (2  tap  case) , y = 1 
(Generalizes  to  m-level  input  sequence) 


Practical  Kesult:  Transmission  over  multipath  channel  with  only  2 strong  paths  can  be 

received  with  very  little  degradation  relative  to  single  path  case. 
(Applies  also  to  data  modems  employing  "duo  binary"  or  "partial  response" 
coding) . 


Strong  paths  and  all  other  paths  are  negligible.  For 
telephone  channels  such  a case  is  often  artificially 
generated  by  applying  so-called  duo-binary  or  partial 
response  coding,  which  creates  a model  consisting  of  only 
two  strong  taps. 

These  results  can  be  generalized  to  an  m-level 
input  sequence.  While  no  operational  hardware  exists 
which  applies  these  techniques,  considerable  experimental 
work  and  simulations  have  been  performed. 
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4 . 0 Third  Application:  Digital  Voice  Compression 

Potentially  the  most  important  application  for  the 
future  is  to  the  compression  of  digital  voice.  In 
Figure  4.1  the  upper  diagram  shows  the  well  accepted  voice 
synthesizer  model  used  in  conjunction  with  linear  pre- 
dictive coding.  This  is  basically  a modern  digital  version 
of  the  model  for  the  channel  vocoder.  The  vocal  tract 
is  modelled  by  a digital  linear  filter  driven  either  by 
an  i.i.d.  sequence  of  Gaussian  variables  or  by  a periodic 
sequence,  the  latter  representing  voiced  sounds  and  the 
former  unvoiced.  Clearly  then  the  output  is  a Markov 
sequence,  although  not  a finite-state  one. 

The  lower  half  of  the  diagram  is  representative  of 
a large  class  of  standard  practical  voice  compression 
techniques,  including  delta  modulation,  differential  PCM, 
and  adaptive  predictive  coding.  In  the  last,  as  in  linear 
predictive  coding,  the  coefficients  a through  a.  are 
estimated  from  short  segments  of  the  actual  voice.  However, 
all-  these  techniques  attempt  to  represent  the  voice  by 
means  of  the  so-called  binary  residual  sequence  consisting 
of  + l's.  All  the  conventional  techniques  make  decisions 
to  generate  this  sequence  one  term  at  a time,  seeking  to 
minimize  the  mean  square  error  for  that  term  only.  An 
improved  technique  for  which  simulations  and  digital  voice 
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Figure  4.1  Third  Application:  Digital  Voice  Compression 

Voice  Synthesizer  Model  {Linear  Predictive  Coding)  - {ci^}  must  estimated  frequently 
(every  10  to  20  millisec) 


Standard  Technique  for  Delta  Modulation;  Differential  PCM;  Adaptive  Predictive  Coding 


Mean  Square  Error 
to  be  minimized 


Select  binary  sequence  one  term  at  a time  to  minimize  MSE  for  that  term  only 

Improved  Technique:  Select  best  path  through  Markov  state  diagram  generated  by  linear 

“ * filter  driven  by  binary  sequence  to  minimize  MSE  for  entire  path 


Performance  and  Practical  Results:  Subjective 


compression  experiments  have  been  performed  at  LINKABIT 
Corporation  involves  estimating  the  entire  sequence  using 
the  algorithm  of  Section  1..  One  may  regard  the  digitized 
uncompressed  voice  as  the  observation  and  one  then  looks 
for  the  binary  sequence  (i.e.,  the  path  through  the  Markov 
chain)  which  best  replicates  the  observation,  given,  of 
course,  the  model  and  its  coefficients.  The  compressed 
digital  voice  consists  of  the  residual  sequence  plus  the 
digitized  estimated  coefficients  {a^}.  The  voice  decoder 
at  the  receiver  reconstitutes  the  voice  sequence  by  passing 
the  residuals  through  the  digital  linear  filter  of  Figure  4.1 
with  these  tap  coefficients. 

Evaluation  of  this  technique  and  any  practical 
application  thereof  must  be  based  on  subjective  results. 
During  the  lecture  a tape  was  played  giving  the  relative 
performance  of  the  algorithm  based  on  estimating  the 
Markov  sequence  as  compared  to  the  results  of  a more 
classical  adaptive  predictive  coding  technique  where  each 
terra  of  the  residual  was  determined  individually.  In 
addition,  the  result  of  adding  errors  caused  by  channel 

noise  was  shown  to  affect  the  algorithm  very  little  as 

-2 

long  as  the  channel  error  rate  was  below  10 
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5.0  Other  Applications 

Another  area  of  application  described  in  the 
literature  has  been  optical  character  recognition,  where 
English  text  was  modelled  by  a first-order  Markov  source, 
and  the  memoryless  channel  was  represented  by  the  optical 
character  recognition  device.  The  algorithm  has  also  been 
applied  to  the  demodulation  of  minimum  shift  keyed  (MSK) 
modulation  and  to  certain  phase  tracking  problems.  Any 
application  which  can  be  modelled  as  shown  in  Figure  1.1 
is  suited  to  the  maximum  likelihood  decision  algorithm 
described  here.  The  breadth  of  its  applicability  is 
remarkably  wide,  apparently  spanning  the  gamut  of  modern 
digital  communications,  which  includes  modems,  channel 
coding  and  data  compression. 
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ABSTRACT 

Since  the  shear  stresses  at  the  wall  are  of  primary 
interest  in  gas  dynamic  erosion,  an  analytical  technique 
is  formulated  for  unsteady  shear  forces  by  consideration 
of  the  continuity  and  momentum  equations  of  unsteady  com- 
pressible boundary  layers.  The  continuity  and  momentum 
equations  are  transformed  into  one  governing  equation  by 
the  use  of  equation  of  state,  power  law  velocity  and 
temperature  profiles,  and  generalized  Blasius'  law  of 
friction.  The  single  partial  differential  equation  is  in- 
tegrated across  the  boundary  layer  to  reduce  by  one  the 
number  of  independent  variables  (three).  The  resulting 
hyperbol i c- type  partial  differential  equation  is  transformed 
into  two  ordinary  differential  equations  by  the  character- 
istics method.  These  equations  are  integrated  to  predict 
the  results  in  a 30mm  weapon  system.  The  results  include 
not  only  various  boundary  layer  thickness  parameters  but 
also  the  shear  stress  at  the  wall  or  skin  friction  coefficient 
which  is  large  near  the  breech  and  decreases  in  the  flow 
direction  except  near  the  projectile.  The  skin  friction 
coefficient  decreases  with  increase  in  time.  The  shear 
stress  at  the  wall  is  of  the  order  of  100  psi. 
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INTRODUCTION 


The  shear  forces  are  not  only  important  in  the  cal- 
culation of  drag  but  also  in  the  formulation  of  force  balance 
in  erosion  models.  The  shear  stresses  also  play  an  important 
role  in  predicting  the  heat  transfer  provided  that  a valid 
analogy  between  skin  friction  and  heat  transfer  is  available. 
The  shear  stresses  acting  on  the  bore  surface  are  essential 
to  formulate  and  predict  the  gas  dynamic  erosion  in  gun 
barrels.  There  are  now  available  numerous  empirical  or 
semi empi ri cal  methods  which  provide  reliable  estimates  of 
turbulent  skin  friction  for  zero  pressure  gradient  compressible 
and  incompressible  turbulent  boundary  layer  flows.  However, 
this  is  not  the  case  for  unsteady  boundary  layers.  The  state- 
of-the-art  in  unsteady  boundary  layers  and  turbulence  models 
is  quite  limited.  Therefore,  any  development  of  analytical 
techniques  with  reasonable  assumptions  is  of  considerable 
importance  to  engineers  and  designers.  A symposium[l ] was 
held  on  unsteady  boundary  layers  at  Laval  University.  There 
are  also  several  papers,  scattered  in  the  literature,  con- 
centrating on  numerical  solutions[2]  of  unsteady  boundary 
layer  equations.  The  similarity  methods  do  not  play  an 
important  role  due  to  severe  restrictions  on  the  generality 
of  the  flow  field.  Apparently,  there  is  no  working  tool 
yet  available  to  analyze  especially  the  case  of  unsteady 
compressible  flow  with  arbitrary  pressure  gradients  and  free 
stream  conditions  outside  the  boundary  layer. 

Yalamanchili  presented  the  Rayl ei gh-Bla s i us  incompressible 
flow[3],  shock-induced  boundary  layers[4],  and  projectile- 
induced  turbulent  boundary  layers[5]  in  AIAA,  TICOM,  and 
ASME  meetings,  respectively.  Even  though  the  continuity, 
momentum,  and  energy  equations  are  considered  in  the  analysis 
of  unsteady  turbulent  boundary  layers[5],  the  density  variation 
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across  the  boundary  layer  is  not  considered  due  to  tedious 
derivations.  However,  another  treatment  of  unsteady  boundary 
layers  Is  considered.  The  transverse  velocity  component 
can  be  expressed  in  terms  of  longitudinal  velocity  component 
and  density  by  the  use  of  the  continuity  equation.  The 
density  is  expressed  in  terms  of  pressure  and  temperature  by 
the  equation  of  state.  Of  course,  the  pressure  is  a given 
quantity  in  any  boundary  layer  where  there  is  no  interaction 
between  the  viscous  and  inviscid  flows.  The  temperature  is 
written  as  a function  of  the  velocity  boundary  layer  thickness 
parameter  and  a nearly  constant  coefficient.  Thus,  the 
energy  equation  is  not  necessary  to  close  the  system  of 
equations.  It  is  to  be  noted  that  the  temperature  is  not 
of  prime  interest  here  even  though  it  can  be  approximated 
from  the  assumed  functional  relationship.  Nikuradse[6]  ex- 
perimental data  suggests  power  law  velocity  profiles  for  the 
longitudinal  velocity  component.  The  single  partial  differ- 
ential equation  Is  Integrated  across  the  boundary  layer  to 
reduce  by  one  the  number  of  Independent  variables.  The 
generalized  Blaslus'  law  of  friction,  relating  the  shear 
stress  at  the  wall  and  the  longitudinal  velocity  distribution, 
is  utilized.  The  resulting  hyperbolic  type  partial  differ- 
ential equation  is  transformed  into  two  ordinary  differential 
equations  by  the  characteristics  method.  These  equations 
are  integrated  to  predict  the  results  In  a 30mm  weapon  system. 
The  results  Include  not  only  various  boundary  layer  thickness 
parameters  but  also  the  shear  stress  at  the  wall  or  skin 
friction  coefficient. 
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II  GOVERNING  EQUATIONS 


It  is  convenient  to  use  integral  boundary  layer  equations 
to  apply  the  method  of  weighted  residuals  (MWR).  The  follow- 
ing equation  can  be  derived  either  by  consideration  of  control 
volume  and  setting  continuity  (mass)  and  momentum  balance 
for  the  same  or  by  application  of  Prandtl's  boundary  layer 
assumptions  to  the  Navier-Stokes  equations  and  integrating 
the  resulting  equations  across  the  boundary  layer: 


Momemtum  Equation: 


- iE. 
9x 


dy  - tw  = 


at 


pudy  + w 


(1) 


Where  u = velocity  component  in  x-direction,  ux  = velocity 
outside  the  boundary  layer,  p = pressure,  p = density, 
t = time  and  xw  = shear  stress  at  the  wall. 

Utilizing  the  following  inviscid  momentum  equation 
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9x 
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and  the  following  identities 
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the  following  single  governing  partial  differential  equation 
is  obtained. 

Tw  = j (Plurpu)  dy  + — pu(u1-u)dy  + j (p^-pujdy 


(pi-p)  dy 


(4) 
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Nikuradse[6]  measurements  of  velocity  distribution  in 
pipes  suggested  the  following  form: 


u^ 

ui 


(5) 


Where  6 i s the  velocity  boundary  layer  thickness,  function  of 
independent  variables  x and  t,  and  n is  a parameter  depending 
upon  the  Reynolds  number  based  upon  mean  velocity  and  diameter 
of  the  tube: 


n 

6 

7 

8 

8.8 

10 

Reynolds 

number 

4xl03 

2 . 3x1 0 4 

1.1x10s 

l.lxlO6 

| 

2 to  3.2 
millions 

The  equation  of  state  and  the  fact  that  the  pressure  is 
constant  across  the  boundary  layer  yields  the  following  re- 
lationship for  density: 


£ = L.  (6) 

Pi  T 


Where  T is  the  temperature  and  the  subscript  i indicates  the 
quantity  outside  of  the  boundary  layer.  If  another  power 
law  profile  is  assumed  for  the  dimensionless  temperature 
(i.e.,  equation  5);  and  setting  Tw  equals  zero  without  any 
loss  of  generality,  the  density  ratio  (equation  6)  may  be 
rewritten  as  equation  (8). 


T-Tw  _ y> 

T -T  'c' 

i i iw  13 


(7) 


p_  = c (^f 1/m 

Pi  L V 


c 1/m 

where  C = (-j-) 


(8) 
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The  variable  c represents  the  temperature  boundary  layer 
thickness  and  it  is  also  greater  than  the  velocity  boundary 
layer  thickness  6 for  the  conditions  in  a gun  barrel.  Since 
the  dimensionless  ratio  of  boundary  layer  thicknesses  (c/6) 

Is  fairly  constant  with  a typical  value  of  2.5  and  the  expo- 
nent m is  large  similar  to  n,  it  Is  a good  approximation  to 
treat  C as  independent  of  x and  t. 

Prandtl  discovered  the  relationship  between  Blaslus' 
law  of  friction  and  the  velocity  distribution  due  to 
Nlkuradse  experimental  data.  If  this  relationship  Is  gener- 
alized to  include  arbitrary  n,  the  following  result  Is 
obtained  for  shear  stress  at  the  wall, 

Bj"!  (9) 

Where  Cf  Is  the  skin  friction  coefficient  and  Cn  Is  given 
In  the  form  of  a table: 


n 

7 

8 

9 

10 

Cn 

mom 

,0174 

,0143 

.0117 

The  use  of  equations  (5),  (8),  and  (9);  the  evaluation  of  In- 
tegrals in  equation  (4),  and  algebraic  manipulations  will 
yield  the  following  hyperbolic  type  of  partial  differential 
equation  with  only  two  independent  variables,  x and  t. 

P "j|^  + 'H'  a R (6,x,t)  (10) 

where  R 55  A Cf  - An  p^  - An  Uj  * D ^ An  Uj-E  ^ An  (pjuJ)  (11 ) 
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The  coefficients  P,Q,A,B,c,D,  and  E are  given  in  the  follow- 
ing table  for  compressible  flow  and  the  case  of  constant 
density  across  the  boundary  layer. 


Coefficient 

Compressible  flow 
p(x,y,t) 

- — — — 

Constant  density  across 
the  boundary  layer,  p(x,t) 

P 

* . ..  ........  — . 

n 

2m-n+mn 

n 

n+? 

Q 

1 

m=T 

1 

A 

. . 

m-n+mn 

i2- 

n+1 

2 

B 

A- « i 

m-T 

1 6 
n+T  6 

c 

1 

! 

m-n+mn(l-C) 

Cm  ^ 

1 * 
n+T  5 

D 

m-n+mn(l-C)  „ 
Cm2  6 

1 A 

n+1 

E 

n 

2m-n+mn 

(n+1  )'(n+2)  6 

It  is  clear  from  this  table  that  the  case  of  constant  density 
across  the  boundary  layer  is  not  easy  to  derive  from  the  general 
case-of  compressible  flow.  The  following  table  is  given  to 
show  the  effect  of  variable  density  on  coefficients. 
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The  Constant  in 
Coefficient 

m = 6 

n = 8 

Compressible  flow 

Constant  density  across 
the  viscous  layer 

| P 

.15 

0.80 

Q 

.20 

1.00 

A 

.64 

4.50 

B 

.20 

0.11 

c 

-.06 

0.11 

D 

-.06 

o.n 

E 

• 

i .15  I 

1 | 

1 1 

0.09 

Some  coefficients  decrease  whereas  others  increase  and  the 
changes  are  not  proportional.  Therefore,  it  is  difficult 
to  assess  the  overall  effect  of  variable  density  across  the 
boundary  layer. 

The  simplified  governing  equation  (10)  can  be  reduced 
to  two  ordinary  differential  equations  by  a procedure  some- 
times referred  to  as  the  method  of  characteristics.  The 
perfect  or  total  differential  can  be  written  as 

d6  - ||  dx  + ||  dt  (12) 

The  substitution  of  the  partial  derivative  8 6/ 8 1 from 
equation  (10)  into  equation  (12)  yields  the  following: 

d6  = (dx  - dt)  ||  + Hi!  dt  (13) 

Q 3x  Q 

If  the  quantity  inside  the  paranthesis  can  be  set  to 
zero  and  these  resulting  curves  considered  as  velocity 
characteristics,  equation  (13)  can  be  reduced  to 
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dS  UiR  ris>  > \ 
dt  ' ~T‘ 


(14) 


along  the  velocity  characteristic 


dx  _ uiP 
dt  Q 


(15) 


Note  that  the  same  equations  (14)  and  (15)  can  also  be 
derived  by  following  the  classical  definition  of  characteris 
tics.  The  characteristics  are  defined  as  the  curves  along 
which  the  derivatives  of  the  fluid  properties,  such  as  96/9x 
and  96/9t,  are  indeterminate.  By  consideration  of  these  two 
partial  derivatives  as  unknowns  in  equations  (10)  and  (12), 
one  can  determine  either  one  of  these  two  unknowns  by  Cramer 
rule  and  set  its  numerator  and  denominator  to  zero  in  order 
to  obtain  the  equations  (14)  and  (15),  respectively. 


Ill  BOUNDARY  LAYER  PARAMETERS 


The  effects  of  the  viscous  shear  layer  are  not  only 
induction  of  shear  stresses  on  the  wall  but  also  involve 
various  effects  on  the  gas  flow.  These  effects  can  be  repre 
sented  by  means  of  various  boundary  layer  parameters.  For 
example,  the  displacement  thickness  ( 6 d ) is  defined  as 


6d 


(1- 


pu 

PlUi 


0 dy 


(16) 


which  represents  physically  the  distance  by  which  the 
external  inviscid  flow  is  shifted  owing  to  the  formation  of 
the  boundary  layer.  The  momentum  thickness  is  defined  as 


^m  ' 


£.  JL  O-JL  ) (|y 
P,  u.  v u.  ' y 


(17) 
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This  parameter  is  useful  in  the  determination  of  laminar- 
turbulent  transition  and  also  indicates  a measure  of  loss 
of  momentum  in  the  boundary  layer.  The  velocity  thickness 
is  defined  as 


6V 


(18) 


The  energy  dissipation  thickness  is  defined  as 
6 


5ed 


£_  JL  ( i _ _!L  ) 2 dy 
Pi  V Uj  ' y 


(19) 


This  parameter  indicates  a loss  of  mechanical  energy  occurr- 
ing in  the  boundary  layer.  The  enthalpy  thickness  is  defined 
as 


6 


e = 


6 

* 

J 

0 


_p_ 

Pi 


_u_ 

“l 


( f - 1)  dy 
1 1 


(20) 


All  of  these  parameters  can  be  evaluated  analytically  by  the 
use  of  equations  (5),  (6),  and  (8).  The  dimensionless  para- 
meters ( 6 d / 6 , 6m/6,  etc.),  are  shown  in  the  following  table 
for  the  case  of  compressible  flow  as  well  as  constant  density 
across  the  boundary  layer. 
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Dimensionless 

Compressible  flow 

Constant  density  across 

Parameter 

- . - - 

the  boundary  layer 

1 

-----  - - ...  i 

— 

Displacement 

” - 

m-n+mn(l-C) 

1 

thickness 

m-n+mn 

rFT  i 

! 

Momentum 

Cm2n 

1 

n 1 

thickness 

(m-n+mn )(2m-n+mn) 

(n+1 ) (n+2)  1 

| 

Velocity 

i i 

1 1 

thickness 

n+1  | 

n+1  f 

! 

Energy  Dissipation 

2Cm3n 

2Cn  \ 

thickness 

(m-n+mn ) (2m-n+mn ) ( 3m-n+mn ) 

(m+1 )(n+2)(n+3) 

Enthalpy 

thickness 

„/  1 Cm  \ 

n ^n+1  m-n+mn ' 

o i 

i 

I 

i 1 ■ ■■  I.M.—  

All  of  these  parameters  will  be  larger  for  the  case  of  variable 
density  than  constant  density  across  the  boundary  layer. 

IV  GUN  BARREL  FLOW 


The  physical  example  is  represented  schematically  as  shown 
in  Figure  1.  As  the  propellant  gases  expand  behind  the  pro- 
jectile, a boundary  layer  forms  at  the  breech  end  and  thickens 
as  the  flow  proceeds  downstream.  An  unusual  feature  of  the 
velocity  boundary  layer  is  that  it  disappears  as  the  projectile 
is  approached  since  all  fluid  at  the  base  of  the  projectile 
must  be  moving  at  projectile  velocity.  Mathematically,  this 
amounts  to  the  requirement  of  an  additional  boundary  condition 
at  a downstream  location.  The  numerical  techniques  applied  to 
most  boundary  layer  problems  call  for  the  specification  of 
profiles  at  the  upstream  end  of  the  flow  and  allow  a "marching" 
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al-ong  the  flow  direction.  For  the  usual  time-dependent 
boundary  layer  problem,  an  initial  condition  to  describe 
the  boundary  layer  flow  at  time  zero  and  boundary  conditions 
as  functions  of  time  are  required.  No  downstream  condition 
is  added.  The  complete  flow  characteristics  are  unknown  for 
gun  barrel  flows.  The  experimental  data  are  lacking  because 
of  the  moving  projectile.  This  obstacle  may  be  overcome 
if  one  takes  advantage  of  the  similarities  between  the  moving 
projectile  (small  mass)  and  a moving  shock  in  a shock  tube. 
Typical  velocity  profiles  are  also  shown  in  Figure  1. 

Fortunately,  the  final  governing  equations  (14)  and  (15) 
are  decoupled.  Therefore,  one  can  solve  these  equations  one 
at  a time.  The  path  of  the  velocity  characteristics  are 
described  by  a linear  ordinary  differential  equation 
(equation  15).  If  the  LaGrangian  approximation  of  interior 
ballistics  is  invoked  here  to  describe  the  free  stream 
velocity,  u15  the  integration  of  equation  (15)  yields 
equation  (17). 


u^mnt}  <16) 

n(m-l ) 

/ v \ / y \ 2m-n+mn 

{k)-{h)  <17> 

where  V = Velocity  of  the  projectile 
X = Location  of  the  projectile 

and  subscript  i indicates  characteri sti c location  initially. 
For  the  characteristics  located  at  the  base  of  the  projectile, 
x-j  and  X-j  will  be  identical. 

Since  the  location  of  characteri sties  at  any  time  are 
known  from  equation  (17),  the  nonlinear  ordinary  differential 
equation  (14)  is  integrated  numerically  by  iterative  technique 
due  to  lack  of  analytical  possibility.  The  following  steps 
are  in  order  (dS/dt  = f ) : 
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•All  values  are  known  at  point  i-1  (could  be  initial  or 
previous  time). 

•The  values  except  the  dependent  variable  are  known  at  point 
i.  An  approximate  value  can  be  obtained  by  computing  f at 
point  i-1,  i.e.,  fj_i  and  the  use  of  the  relationship, 

<$i  = <5 i-i  + fi_]  At. 

•Compute  now  fi  with  the  above  approximate  dependent  variable 
value.  Finally,  compute  <$i  = 6 -j  _ i + At  (f-j  + f -j  _ -j  ) / 2 . 

•Repeat  the  above  (last)  step  until  the  desired  degree  of 
convergency  is  reached.  The  convergence  to  a final  value 
is  very  rapid.  It  is  found  that  no  more  than  three  iterations 
are  required. 

This  procedure  is  repeated  for  each  time  step  and  for  all 
characteristics.  The  CPU  time  on  IBM  360/65  is  of  the  order 
of  5 seconds  for  11  characteristics  and  16  time  steps. 

Since  each  characteristic  curve  can  be  calculated  without 
reference  to  adjoining  characteristics,  the  accuracy  of  the 
calculations  for  velocity  boundary  layer  thickness,  and  in 
turn  various  boundary  layer  parameters  including  the  skin 
friction  coefficient  does  not  depend  on  the  number  of  character- 
istics chosen.  A large  number  of  characteristics  mean  the 
results  are  given  at  closer  intervals  in  the  stream-wise 
direction,  but  these  are  not  more  accurate  at  a calculated 
point.  A large  number  of  characteristics  will  lead  to  more 
computational  times. 

The  accuracy  of  the  integration  of  the  boundary  layer 
along  each  characteristic  depends  on  the  time  interval  chosen. 
Accuracy  and  computational  times  increase  with  decrease  in 
the  time  step.  Because  of  the  iteration  technique  used  in 
each  step,  very  small  times  are  not  required  for  good  accuracy. 

There  is  a singularity  problem  in  the  initiation  of 
calculations.  Since  5 and  V become  zero  initially  (t=o)  in 
the  important  terms  of  the  right  hand  side  of  equation  (14) 
or  in  the  following  equation,  the  integration  procedure 
described  above  fails. 
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di  f...  1 ^ f 2ACnU»  B 3p,  ,1  dV  1,  (D+2E)u,  Eu,  Bp.  1 

3t  " <m-'>L^T5T/'+n  ' p7-3T‘  c(FH¥  X>  — * L " liN  <18> 

' v ' 

This  is  due  to  equation  (16)  and  its  variations: 


l5n  „ - 1 dV  1 

at  An  Ul  v dt  ' x 

9x  £n  ui  = x 


(19) 

(20) 


To  initiate  the  computations  when  6 and  V become  zero, 
equation  (18)  is  approximated  to 


d6  _ 

at  ' 


(m-l  ) 


f 2ACnU_i.  _c_  au, 

L M2/l+n  ‘ ux  ft- 

' V 


(21) 


Taking  ux  = Ajt,  where  Ax  is  the  local  gas  acceleration  at 
position  x and  constant  for  the  first  time  step,  equation 
(21 ) becomes 


d<5  m- 1 
eft  " Cm2' 


(|)  (m-n+mn(l-C) 

,Ui_6.  / L 

' V ' 


(22) 


Let  4>  = i+n 

6 = ^ 

P = (lH)  (m-n+mn(l-C) ) 
q = ^lCn  A,1"*  ( i+4>) 
then,  equation  (22)  becomes 

§*i*-«*1-* 

The  solution  of  equation  (24)  is 


= _l_t2" 

2+p- 


(23) 


(24) 


(25) 
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or  6 = (m-D(2+n)Cnv(t)A1-n  t1"” 

Cmz(l-n)  + (m-1 ) (2+n)(m-n  + mn(l-C)) 


(26) 


This  is  used  to  initiate  the  computations  for  the  first  two 
time  steps  in  order  to  avoid  the  singularity  problem.  For 
the  projectile  initiated  characteristics  (i.e.,  at  the  pro- 
jectile base)  where  only  6 becomes  zero  but  not  V,  equation 
(14)  or  (18)  can  be  multiplied  by  62^  and  the  resulting 
equation  can  be  integrated  numerically  as  mentioned  above 
without  any  problem  of  singularity. 

The  following  data  is  assumed  for  the  XM140  (30mm) 
weapon  system  in  order  to  compute  the  characteristics  of 
velocity  boundary  layers: 

Barrel  Length  = 42.0  inches 

Chamber  Length  = 2.48  inches 

Reference  dynamic  viscosity  at  530°R  = 0.00003  Ibs/ft-sec 

m = 7,  n = 7 

time  step  = 0.0001  seconds 


Time 

Density 
lb  f/ft3 

Projectile  Velocity 
(ft/sec) 

Temperature, °R 

0.00000 

6.5 

0 

5200 

0.00040 

8.2 

- 

- 

0.00085 

5.0 

1600 

4200 

0.00130 

3.0 

2050 

3550 

0.00170 

2.3 

2200 

3200  i 

0.00215 

1.8 

2230 

I 

j 

0.00240 

i 

2240 

2800  | 

1 

1 

The  characteristics  are  shown  in  Figure  2.  The  initial 
positions  of  the  characteristics  are  chosen  so  that  the  region 
of  interest  is  covered.  Since  the  velocity  characteri sties 
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have  a steeper  slope  as  shown  in  Figure  2,  it  is  necessary  to 
use  characteristics  originating  both  in  the  original  pro- 
pellant chamber  and  also  at  the  projectile  base  during  the 
motion. 

The  velocity  boundary  layer  thickness,  6,  is  shown  in 
Figure  3.  The  boundary  layer  grows  not  only  with  time  but 
also  with  increase  in  x except  near  the  base  of  the  projectile. 
The  growth  is  steeper  at  the  projectile  base  than  near  the 
breech  end.  This  analysis  predicts  the  growth  and  also  the 
decay  continuously  without  any  separate  treatment.  The 
boundary  layers  are  much  thicker  than  predicted  by  Nordheim, 
Soodak,  and  Nordheim[7].  It  is  straightforward  to  compute 
any  other  parameter  of  the  boundary  layer  from  this  velocity 
boundary  layer  thickness  parameter. 

The  dimensionless  skin  friction  coefficient  is  shown  in 
Figure  4.  This  is  maximum  near  the  breech  end  and  decreases 
in  the  flow  direction.  The  skin  friction  coefficient  de- 
creases also  with  increase  in  time.  This  also  implies  that 
the  unsteady  flows  are  more  susceptible  to  viscous  effects 
than  steady  flows.  Of  course,  the  shear  stress  at  the  wall 
can  be  computed  by  the  use  of  skin  friction  coefficient  and 
equation  (9). 

The  distribution  of  shear  stresses  at  the  wall  in  PSI 
units  are  shown  in  Figure  5.  The  trend  is  somewhat  opposite 
to  that  of  the  skin  friction  coefficient,  i.e.,  the  shear 
stresses  are  maximum  near  the  projectile  base.  The  shear 
stresses  first  increase  and  then  decrease  with  increase  in 
time.  The  shear  stress  at  the  wall  will  be  maximum  at  the 
muzzle  end  of  the  gun  barrel.  Therefore,  the  gas  dynamic 
erosion  will  be  more  severe  at  the  muzzle  end  than  near  the 
origin  of  rifling.  Even  though  the  shear  stresses  are  of 
the  order  of  100  psi  and  may  not  create  significant  erosion 
on  high  strength  material  such  as  steel,  these  shear  stresses 
can  easily  wipe-off  any  incipient  melting  of  the  bore  surface 
materi al . 
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INITIAL  CONFIGURATION 
OF  SOLID  PROPELLANT 


FIGURE  I 


SCHEMATIC  OF  GUN  BARREL  AND  ASSOCIATED 
VELOCITY  PROFILES 
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FIG.  2 PATH  OF  VELOCITY  CHARACTERISTICS  AND  PROJECTILE 
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ABSTRACT 

In  a recent  development  of  a steel  cartridge  case,  high  extraction 
forces  were  encountered  that  were  caused  by  the  sticking  of  the  case  to 
the  chamber.  This  sticking  condition  arose  because  a steel  cartridge  case 
has  less  recovery  than  a brass  case.  This  reduced  recovery  can  be  attri- 
buted to  the  modulus  of  elasticity  of  steel  that  is  much  higher  than  the 
modulus  of  elasticity  of  brass.  A relatively  simple  nonlinear  elasto- 
plastic  analysis  has  been  developed  to  parametrically  analyze  the  cartridge 
case/chamber  interaction  under  actual  firing  conditions  when  the  cartridge 
case  is  loaded  near  the  maximum  material -carrying  capacity.  In  contrast 
to  the  usual  assumption  that  the  chamber  is  rigid,  the  chamber  is  con- 
sidered to  be  deformable  in  this  investigation.  The  interaction  of  the 
steel  cartridge  and  the  chamber  is  studied  parametrically.  With  the  use 
of  the  analysis  proposed  in  this  paper,  a set  of  design  parameters  can  be 
found  to  ensure  uniform  and  acceptable  performance  of  a steel  cartridge  case. 

INTRODUCTION 

Cartridge  cases  for  small -arms  ammunition  have  been  traditionally 
designed  for  brass.  However,  because  of  the  limited  natural  supply  of 
brass  and  its  predominant  use  in  small  arms  ammunition,  a strategic  signi- 
ficance has  been  attached  to  it.  With  the  increasing  emphasis  of  fire- 
power and  inadequate  domestic  supply,  the  usage  of  brass  during  a major  war 
could  become  critical.  Hence,  the  use  of  alternate  materials  for  cartridge 
cases  of  small-arms  ammunition  is  very  important.  Aluminum  and  steel  are 
considered  to  be  more  economical  materials  than  brass  for  this  application. 
The  major  difficulties  in  the  development  of  aluminum  cartridge  cases  are 
those  of  the  so-called  "burn-through"  problems.  Existing  literature  in- 
dicates that  the  sidewall  of  an  aluminum  cartridge  case  that  split  during 
firing  caused  serious  erosion  of  the  case  head  and  the  chamber  of  a gun. 
Serious  erosion  damage  in  the  Ml 6 Rifle  chamber  due  to  aluminum  case  splits 
has  been  observed  [Ref  1].  Hence,  from  the  economical  point  of  view,  the 
development  of  steel-cased  cartridges  is  of  great  importance  at  the  present 
time.  In  the  recent  development  of  steel  cartridge  cases,  the  problem  has 
arisen  in  which  the  cartridge  case  becomes  stuck  in  the  chamber,  resulting 
in  high  extraction  loads.  The  main  cause  of  the  "sticking"  problem  is 
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that  a steel  cartridge  case  has  less  recovery  than  a brass  case  since 
the  modulus  of  elasticity  of  steel  is  much  higher  than  the  modulus  of 
elasticity  of  brass. 

The  objective  of  this  investigation  was  to  develop  a relatively 
simple  nonlinear  elastoplastic  method  for  analyzing  and  designing  a 
steel -cased  cartridge  to  remedy  the  sticking  problem.  The  interaction 
of  a steel-cased  cartridge  and  the  chamber  of  a gun  will  be  studied 
parametrically.  A set  of  design  parameters  such  as  cartridge  case  ma- 
terial properties,  chamber  pressure,  cartridge  configuration,  the 
initial  clearance  between  a case  and  a chamber,  and  the  configuration 
of  the  chamber  of  a gun,  will  be  established  to  ensure  uniform  and 
acceptable  performance  of  a steel-cased  cartridge.  In  contrast  to  the 
usual  assumption  that  the  chamber  is  rigid,  the  chamber  is  considered 
to  be  deformable  in  this  investigation. 

THEORY 

An  inelastic  theory  for  a cartridge  case  is  developed  on  the  basis 
of  the  following  assumptions: 

a.  The  material  is  homogeneous  and  isotropic. 

b.  The  inelastic  deformation  is  time-independent. 

c.  Hencky's  stress-strain  relation  is  valid. 

d.  The  material  is  assumed  to  be  compressible. 

e.  Von  Mises'  yield  criterion  is  valid. 

The  use  of  cylindrical  coordinates  (r,6,z)  is  convenient  where  z is 
coincident  with  the  longitudinal  axis  of  the  case.  For  any  point  in  a 
cartridge,  the  nonvanishing  stress  components  are  ar,  oq,  and  az  while 
the  nonvanishing  strain  components  are  cr,  eg,  and  ez. 

Elastoplastic  Solution  of  Case-Chamber  Problem 

A longitudinal  cross  section  of  a cartridge  case  is  shown  in  Figure  1. 
The  cartridge  case  is  divided  into  N rings  with  z0=£,  zx,  z2,...zn=L. 

Let  F0  be  the  bullet-pulling  force.  The  load  acting  on  the  base  of  a 
bullet  is  given  by 

F = it  r 2 Pg  (1) 

where  Pg  is  propellant  gas  pressure. 

The  stress  components  acting  on  a ring  with  radius  r are 
Pr 

Oa  = (2) 

6 tCosB 

°r  = 57  00  = ' (3) 
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(4) 


Pr 

°z  ' 2tCosg 

= P(rf-rl) 

2rtCos3 


if  F < F0 


if  F > F0 


(5) 


Note  that  P = Pq-P j where  Pj  is  the  interface  pressure  between  a 
cartridge  case  and  the  chamber. 

Cartridge  Case  Loaded  Into  Inelastic  Region: 

In  the  development  of  an  elastoplastic  solution,  a loading  function 
must  be  specified  in  which  the  plastic  deformation  and  the  subsequential 
yield  condition  are  defined.  For  linear  strain-hardening  materials,  the 
loading  fucntion  of  Hill  [Ref  2]  can  be  written: 

a = (l-a)cxyC  + aEce  (6) 

where  aEc  is  the  slope  of  the  straight  line  in  the  plastic  region  and  a 
may  be  considered  as  a strain-hardening  factor  for  the  material,  oyc 
is  the  yield  stress  of  the  cartridge  case  material  in  tension,  5 and 
e are  effective  stress  and  effective  strain,  respectively,  which  are 
defined  as 

5 = J-  t(or~°e)z  + (°e~az)Z  + (az~ar)2t2  (7) 


and 

e = 'y  [( er-ee )2  + (ee-ez)2  + (ez_er)2^ 
since  e = ee  + iP,  equation  (6)  can  be  reduced  to  the  relation  [Ref  3] 


o 


°y  c 


+ aEc  -P 
1-0 


or 


eP  = 


1-g 

«EC 


(o  -oyc) 


(9) 

(10) 


With  the  use  of  Hencky's  stress-strain  relation  [Ref  4], 
_P 

i = 7 - 

a 


'U 


l h±e9 


(ID 
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where  Sjj  is  a deviatoric  stress  tensor.  Hence, 

4 - I ? Se  ■ \ |P[2o0-or-a2]  (12) 

Substituting  eq.  (10)  into  eq.  (12)  yields 

= d )^a0"ar"az)  (13) 

With  the  use  of  the  strain-deflection  relation 

. - 04) 

e0  r 

where  uc  is  the  radial  displacement  of  the  cartridge  case  at  radius 
r,  C0  is  the  circumferential  strain  which  is  the  sum  of  the  elastic 
part  and  the  inelastic  part,  i.e., 

£0  ■ (15) 

Hence,  equation  (14)  can  be  written 

uc  = r(£g  + Eg ) (16) 

From  the  theory  of  elasticity,  one  obtains  the  following  equation 

e6  = ‘ vc(ar  + az)]  (17) 

Substituting  eqs.  (13)  and  (17)  into  eq.  (16),  then 

uc  = K^[ae-vc(ar+az)] (]  -^e-)(2a0-ar-az)}  (18) 

Let  e be  the  initial  clearance  between  cartridge  case  and  chamber. 

Then  the  displacement  of  the  chamber  at  the  inner  surface,  ub»  can 
be  written 

ub  = uc  - e (19) 

If  the  chamber  is  assumed  to  be  a thick-walled  cylinder  subjected  to 
internal  pressure  Pj,  then  one  has  [Ref  5] 

ub  = uc  - e = ^ Cpiff  + vb]  (20) 
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Hence, 


1 „rb2+a2  x , t 

+ vbJ 

if  uc  > e 

(21) 

Pj  = 0 

if  uc  < e 

(22) 

Note  that: 

Pg  = P + Pi 

if  uc  > e 

(23) 

Pg  = P 

if  uc  - e 

(24) 

Cartridge  Case  Loaded  In  Elastic  Region: 

If  the  cartridge  case  is  loaded  in  the  elastic  region,  i.e., 
a < oyC,  then  the  following  stress-strain  and  strain-displacement  relations 
will  be  used 

er  = "jf  C^r  “ vc  (cfe  + crz ) ] 

£0  = [a©  - vc  (ar  + az)]  (25) 

ez  = -—  [crz  - vc  (ar  + oq)] 

Lc 

and 

uc  = re6  = -rP  [oe  - vc  (or  + az)]  (26) 

tc 

During  the  unloading  process,  the  behavior  of  the  cartridge  case 
and  the  chamber  is  considered  to  be  elastic.  The  dimensions  used  are 
the  dimensions  of  the  case  and  chamber  computed  at  the  peak  pressure. 

A negative  pressure  is  applied  to  the  case  during  the  unloading  process. 

The  result  of  unloading  is  then  superposed  on  the  result  at  the  peak 
pressure  and,  hence,  the  interference  pressure  and  the  extraction  force 
can  be  determined. 
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COMPUTATIONAL  METHOD 


Loading 

Since  a large  displacement  of  the  cartridge  case  may  be  involved 
in  the  present  case-chamber  interface  problem,  the  true  displacement  of 
the  cartridge  case  at  the  peak  pressure,  Pmax,  would  not  be  known  at 
the  beginning  of  the  loading  process.  Hence,  an  incremental  loading 
procedure  will  be  utilized  to  obtain  solutions  in  this  investigation. 

For  each  increment  of  pressure,  the  calculation  procedure  can  be 
stated  as  follows: 


Step  1 

Specify  an  increment  of  pressure  AP-j  acting  on  the  case. 
Pi  = Pi.,  + &P, 

then : 

Step  2 

Calculate  oq,  ar,  and  oz  by  using  eqs.  (2),  (3),  and  (4) 

or  (5) 

Step  3 

Calculate  effective  stress,  a,  by  using  eq.  (7).  Note  that 
if  a < ayC,  case  loaded  in  elastic  region 

if  o > Oy c,  case  loaded  into  inelastic  region 

Step  4 

If  5 < OyC,  calculate  circumferential  strain 

£0  = F"  ~ vc(ar  + oz)] 

tc 

Step  5 

If  a > 0yC , calculate  effective  plastic  strain 
■ £ (5  - eye) 

Step  6 

Calculate  plastic  component  of  circumferential  strain 

e§  = 1 ^ C2ae  - ar  - az] 
^ a 

Step  7 

Compute  radial  displacement  of  the  case 

uc  = r{-j^  [oe  - vc(or  + cz)]  + eg  } 
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Step  8 Compute  bore  displacement  of  the  chamber 
ub  = uc  - e if  uc  > e 
=0  if  uc  < e 

where  e is  the  initial  clearance  between  case  and  chamber 


Step  10  Compute  propellant  gas  pressure 

Pg  = Pi  + Pj 

pg  < pmax5  let  r = r0  + uc,  then  refer  back  to  Step  1. 

All  computations  to  continue. 

Step  11  If  Pg  = Pmax,  then  let 

Pi  = Pi 
Uc  = uc 
ub  " Ub 

rmax  ro  + uc 

Hence,  Pj,  uc,  U5,  and  rmax  are  interface  pressure  between  the 
case  and  the  chamber,  displacement  of  the  case,  displacement  of 
the  chamber  at  bore,  and  radius  of  the  case  at  the  peak  pressure, 
respectively. 
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Unloading 


Step  1 Specify  -AP-j,  then  calculate  Pi  by 
Pi  ■ Pi^-APi 

Step  2 Compute  ap,  a@,  and  az 

Step  3 Compute 

£0  = [ae  - vc(ar  + az)] 

Ec 

Step  4 Compute  case  radial  displacement 

«cs[  [a6  “ vc(ar  + oz)] 


Step  5 Compute  interface  pressure 


Pi  - 


uc  Eb 


a-l 


b +a 

bZ~OL 


2 

r + vb  ] 


Step  6 Applying  the  superposition  principle,  the  interface  pressure 

between  case  and  chamber  and  the  case  and  chamber  displacements 
at  unloading  can  be  computed  by  the  relations  of 


p = Pj  + Pj 

if  |uc| 

< % 

= 0 

if  |uc| 

- % 

uc  ■ uc  + uc 


% ■ % + % 

if  |up| 

- % 

= 0 

if  1 uc 1 

> ub 

where  P is  the  interface  pressure  during  unloading 
u is  the  case  displacement  during  unloading 
and  ub  is  the  bore  displacement  of  chamber  during  unloading 
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Step  7 Compute 

Pg  = Pi  + P! 

if  Pg  + Pmax  >0,  then  let  r = rmax  + uc,  refer  back  to  Step  1. 
All  computations  to  continue. 

Step  8 If  Pg  + Pg  * 0,  then  compute  extraction  force.  Mote  that  when 
the  unloading  process  is  completed,  the  interface  pressure  is 
automatically  computed  at  Step  6 which  is  used  to  compute  ex- 
traction force  with  assigned  friction  coefficient  between  the 
case  and  the  chamber. 


ANALYSIS  OF  RESULTS 

The  steel  cartridge  case  considered  in  this  investigation  is  shown 
in  Figure  1.  The  outer  diameter  of  the  chamber  is  assumed  to  be  one  inch. 
To  obtain  numerical  solutions,  the  cartridge  case  is  divided  into  32  rings. 
The  thickness;  the  initial  clearance  between  case  and  chamber;  material 
properties  such  as  yield  strength,  modulus  of  elasticity,  strain-hardening 
factors,  etc.,  for  each  ring;  and  the  peak  pressure  are  considered  to  be 
independent  design  parameters.  The  effects  of  those  parameters  on  the 
extraction  forces  are  analyzed.  A steel  cartridge  case  is  considered  in 
this  investigation,  and  since  the  modulus  of  elasticity  of  steel  does  not 
vary  much,  it  will  be  considered  as  a constant,  E = 30xl06  psi,  for  each 
ring. 

Effect  of  the  Peak  Pressure 


The  magnitude  of  the  peak  pressure  is  a very  important  factor  in 
causing  the  sticking  problem  of  the  steel  cartridge  case.  The  effect  of 
the  peak  pressure  on  the  resulting  extraction  force  is  shown  in  Figure  2. 

In  this  figure,  note  that  the  incidence  of  cases  that  stick  to  the  chamber 
can  be  significantly  reduced  or  eliminated  if  the  peak  pressure  can  be 
reduced  to  a certain  level.  A reduction  of  peak  pressure  (all  other  factors 
held  constant)  will  reduce  the  round  impulse  and  velocity.  One  method  of 
maintaining  the  round  impulse  and  the  velocity  while  the  peak  pressure  is 
being  reduced  is  to  cause  the  Pressure-Time  (P-T)  curve  to  become  flatter. 

Effect  of  Yield  Strength  of  Case  Material 

The  material  properties  such  as  yield  strength  of  a steel -cased 
cartridge  also  play  a very  important  role  in  the  sticking  problem.  The 
effect  of  yield  strength  of  a case  material  on  the  extraction  force  is 
shown  in  Figure  3.  The  extraction  force  can  be  reduced  if  a case  is  made 
of  a higher  strength  material,  as  indicated  in  Figure  3.  However,  the  in- 
creasing of  the  strength  of  material  is  limited  by  other  material  properties 
such  as  hardness  and  elongation.  If  the  hardness  of  a cartridge  case  is 
increased,  then  the  ductility  of  material  will  be  reduced  and  hence  the 
incidence  of  a split  case  will  be  increased. 
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Effect  of  Initial  Clearance  Between  Cartridge  Case  and  Chamber 


The  relation  of  clearance  between  cartridge  case  and  chamber,  and 
the  extraction  force  is  shown  graphically  in  Figure  4.  Note  that  the 
increasing  of  an  initial  clearance  will  reduce  the  force  required  to 
extract  a case  that  is  sticking  to  the  chamber.  However,  this  increas- 
ing of  the  initial  clearance  may  increase  the  incidence  of  malfunctions 
in  feeding. 


CONCLUSIONS 


A simple  nonlinear  solution  technique  has  been  presented  for 
solving  problems  in  the  interaction  of  a steel  cartridge  case  and  a 
chamber.  Both  nonlinear  material  response  and  geometric  nonlinearity 
have  been  taken  into  consideration  in  this  investigation.  Material 
nonlinearity  has  been  taken  into  account  by  use  of  the  theory  of  plas- 
ticity and  Hencky's  constitutive  equations.  An  incremental  loading 
procedure  has  been  used  in  consideration  of  large  deformation  of  a 
cartridge  case. 

Guidance  for  the  selection  of  design  variables  such  as  yield 
strength  of  cartridge  case  material,  peak  chamber  pressure,  and  initial 
clearance  between  cartridge  case  and  chamber  is  presented  graphically 
in  this  paper. 
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Fig.  1 Longitudinal  Cross  Section  of  A Cartridge  Case 


Fig.  2 Effect  of  the  Peak  Pressure  On  The  Extraction  Force 
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Fig.  3 Effect  of  the  Yield  Strength  of  the  Case  Material 

On  The  Extraction  Force 
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CLEARANCE  (in.) 


Fig.  4 Effect  of  the  Clearance  Between  Case  and  Chamber  On  The  Extraction  Force 


NINE -POINT  DIFFERENCE  SOLUTIONS  FOR  POISSON'S  EQUATION 


J.  Barkley  Rosser 
Professor  of  Mathematics 
Mathematics  Research  Center 
University  of  Wisconsin-Madison 
Madison,  Wisconsin  53706 


Abstract.  It  is  shown  that  "stencils"  exist  for  the  sixth 
order  solution  of  Poisson's  equation  by  use  of  a nine-point 
difference  approximation.  This  enables  one  to  get  more 
accurate  approximations  for  the  solution  with  less  labor. 
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NINE-POINT  DIFFERENCE  SOLUTIONS  FOR  POISSON'S  EQUATION 

J.  Barkley  Rosser 

Introduction.  Suppose  we  wish  to  approximate  the  solution  of 

(1.1)  V2u(x,  y)  = f(x,  y) 

inside  some  region,  given  values  around  the  boundary.  A standard 

approach  is  to  introduce  a "stencil",  say 


0 

1 

0 

1 

-4 

1 

0 

1 

0 

u(x,  y)  . 


The  significance  of  this  is  that  if  a coefficient  in  the  stencil  is  m units 
above  the  horizontal  center  line  end  n units  to  the  right  of  the  vertical 
center  line  (m  and/or  n may  be  negative),  one  forms  the  product 
of  the  coefficient  with  u(x  + nh,  y *mk);  the  entire  stencil  denotes  the 
sum  of  these  products.  Thus  the  stencil  shown  above  denotes 

u(x  + h,  y)  + u(x  - h,  y)  + u(x,  y + k)  + u(x,  y - k)  - 4u(x,  y) . 

To  approximate  the  u(x,  y)  which  solves  (l.  l)  it  is  traditional 
to  choose  k s h.  Also,  to  save  space,  we  write 


Sponsored  by  the  United  States  Army  under  Contract  No.  DA- 31 -1 24-ARQ-D-462. 
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(1.2) 


u(x,  y)  . 


A5u(x,  y)  = 


0 

1 

0 

1 

-4 

1 

0 

1 

0 

This  is  called  the  five-point  stencil,. 

If  u(x,  y)  is  reasonably  smooth,  we  have 

(1.3)  A|_u(x,  y)  & h2'72u(x,  y) 

4 

to  order  h . So,  if  h is  reasonably  small, 

(1.4)  Aj.u(x,  y)  s h2f(x,  y) 

is  a good  approximation  to  (1.  l).  Let  us  choose  a suitable  (xQ,  y ) 
and  define 

(1.5)  um>n  = u(x0  +rah’y0 +nh>  • 

4 

Then  to  approximation  h , we  conclude 

2 

(1.6)  A.u  s h f(xrt  + mh,  y.  + nh)  . 

d m,  n v 0 ’ 0 

The  equations  (l.6)  are  a set  of  linear  equations,  which  have  a unique 

exact  solution,  u . If  we  solve  these  linear  equations,  we  will 
’ m,n  ’ 

get  quantities  u such  that  u differs  from  u by  order  h 
m,n  m,  n m,  n 

(assuming  smooth  boundary  conditions). 
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This  is  the  basis  for  several  schemes  for  computing  numerical 

approximations  for  u(x,  y)  at  the  "grid  points"  (x^  + mh,  + nh). 

2 

However,  since  the  accuracy  is  only  to  order  h , it  is  not  possible 
to  get  very  high  accuracy.  Even  moderate  accuracy  requires  solving  a 
very  large  number  of  simultaneous  equations. 

If  f(x,  y)  = 0,  one  can  get  higher  order  approximations  by  use  of 
a nine-point  stencil,  and  so  improve  the  situation.  The  nine-point 
stencil  is  given  by 


(1-7) 


^9u(x,  y) 


1 

4 

i 

4 

-20 

4 

1 

4 

1 

u(x,  y)  . 


In  general,  one  has 

(1.8)  A9u(x,  y)  s 6h272u(x,  y) 

4 

only  to  order  h . So  if  one  solves 

(1.9)  A9Um,n  = 6h2f<x0  + mh’y0  + nh^  ’ 

2 

one  will  still  usually  only  get  an  approximation  to  order  h . However, 
if  one  solves 

(1.10)  A u = 0 , 

9 m,n  ’ 
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one  will  get  an  approximation  to  order  h for  the  solution  of 

(1- 11)  ^2u(x,  y)  = 0 , 

PROVIDED  one  has  smooth  enough  boundary  conditions. 

The  object  of  the  present  paper  is  to  present  methods  for  solving 
(l.  l)  by  a nine-point  stencil  to  order  h^  even  when  f(x,  y)  is  not 
identically  zero.  We  should  warn  that  these  methods  will  fail  unless 
the  boundary  conditions  and  f(x,  y)  are  smooth.  In  particular,  f(x,  y) 
should  have  bounded  derivatives  up  to  order  six  for  the  methods  of  this 
paper  to  succeed. 

For  completeness,  we  repeat  certain  material  from  Rosser  [ 1 ] . 

2.  A fourth  order  method.  In  Section  1,  we  contemplated  dividing 
our  region  into  squares.  For  some  types  of  regions,  it  would  be 
convenient  to  divide  the  region  into  rectangles.  It  is  widely  believed 
that  difference  methods  cannot  be  constructed  to  give  approximations  of 
order  greater  than  two  unless  the  region  is  divided  into  squares.  This 
is  not  so.  We  will  explain  a method  that  gives  approximations  of  order 
four  if  rectangles  of  sides  h and  k are  used. 

Let  us  temporarily  set 

(2.1)  u = u(xQ  + mh,  yQ  + nk)  . 

That  is,  we  use  rectangles  whose  corners  are  the  grid  points 
(xQ  + mh,  yQ  + nk). 
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Define 


(2.2) 


12h2k2 
h2  + k2 


(2.3) 


2 2 
10k  - 2h 

l2  .2 
h + k 


(2.4) 


10h2  - 2k2 
h2+k2 


We  have 


(2.5) 

(2.6) 


(b  + 2)h2  = A = (c  + 2)k2  , 


b + c = 8 . 


Define  a modified  nine-point  stencil 


(2.7) 


A9u(x,  y) 


1 

c 

1 

b 

-20 

b 

1 

c 

1 

here  motion  of  one  unit  in  the  y-direction  in  the  stencil  is  supposed 
to  induce  a change  of  k in  y,  as  in  our  original  definition. 

If  u(x,  y)  is  smooth,  we  have  to  order  h^  + k^ 

2 

(2.  8)  A*u(x,  y)  x A72u(x,  y)  + uxxxx(x>  v)  + h2k2uxxyy(x>  ^ 


k2A  . 

+ IF  uyyyy  x’  y)  ' 
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By  (l . l)  we  have 


u (x,  y)  + u (x,  y)  = f (x,  y) 

xxxx  xxyy  ’ xxv  ’ 

u (x,  y)  + u (x,  y)  = f (x,  y)  . 

xxyy  ’ yyyy  yy 

2 c 

If  we  multiply  the  first  of  these  by  h A/l2  and  the  second  by  k‘ 
and  add,  we  see  by  (2.2)  that  we  can  write  (2.8)  as 

2 2 

(2.9)  Aju(x,  y)  a Af(x,  y)  + ^(x,  y)  + fyy(x,  y)  . 


Observe  that 


h2f  (x,  y)  e*  f(x  + h , y)  + f(x  - h , y)  - 2f(x,  y) 

AA 

k2fyy(x,  y)  a f(x,  y + k)  + f(x,  y - k)  -2f(x,  y)  . 

6 6 

Thus  we  conclude  finally  that  to  order  h + k 


(2.  10) 


* h\2 

Aqu(x,  y)  a 


u2  , 2 

h + k 


0 

1 

0 

1 

CO 

1 

0 

1 

0 

f(x,  y) 


So,  if  we  solve 


A/l  2 
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(2.11) 


we  will  get  u that  differ  from  u by  the  order  h + k . 
m,n  m,  n 

This  gives  a method  of  order  four  for  rectangular  grid  elements. 


3.  A sixth  order  method.  If  there  is  a sixth  order  method  that 
permits  the  use  of  rectangular  grid  elements,  we  have  no  knowledge  of 
it.  So  we  return  to  square  grid  elements,  adopting  again  the  notations 
of  Section  1 . 

g 

If  (l.l)  holds  and  u(x,  y)  is  smooth  enough,  then  to  order  h 

? .4  ,6  6 

(3.1)  A9u(x,y)  - 6h  f(x,  y)  + — V f(x,  y)  + ^ \7  f(x,  y)  + ^ fxxyy(x’  ^ ■ 

This  does  not  agree  exactly  with  equation  (20.  57)  on  p.  194  of 
Forsythe  and  Wasow  [ 2] . However,  they  claim  that  their  (20. 57)  is  copied 
from  another  reference,  but  they  made  a mistake  in  copying.  Our  (3.  l) 
agrees  with  the  formula  from  which  (20.  57)  was  supposed  to  be  copied. 

We  note  that 
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to  order  h , if  f(x,  y)  is  sufficiently  smooth.  So  we  may  replace 
(3.1)  by 


(3.3)  Y(x,y)s^ 

This  formula  appears  in  Collatz  [ 3]  as  one  of  the  stencils  in 
Table  VI  on  p.  543. 

There  are  occasional  circumstances  in  which  this  would  be  quite 
adequate;  for  instance  if  f(x,  y)  is  a harmonic  function.  However,  in 
general  we  must  do  better. 

For  computational  purposes,  when  we  replace  the  u by  u 

m, n m,n 

and  attempt  to  solve,  it  does  not  matter  what  is  on  the  right  side  of  our 
equations,  as  long  as  we  maintain  the  same  form  on  the  left  side.  So 
there  is  no  reason  why  we  must  restrict  the  right  side  of  (3.  3)  to  a 
nine-point  stencil. 
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As  we  will  have  a solution  to  order  h , we  need  not  take  h 
particularly  small.  Useful  results  have  been  obtained  with  h = L/6, 
where  L is  a basic  dimension  of  the  entire  figure.  With  h = L/lO 
accurate  results  should  result,  and  with  h = L/20  quite  high  accuracy 
should  result.  For  a square  region,  this  requires  at  most  441  grid  points, 
including  the  boundary.  It  would  not  overload  the  memory  to  compute 
f(x,  y)  in  advance  at  every  grid  point,  store  these  values,  and  call  such 
as  are  needed  for  each  application  of  (3.  3)  or  its  replacement.  In  fact, 
it  would  be  an  efficient  scheme  of  computation.  So  we  use  a larger 
stencil  on  the  right  of  (3.  3). 

We  note  that 


(3. 4)  h2Wx’  y)  = - f(x  + 2h>  y) 

+ 3 f(x  + h,  y)  - | f(x,  y) 

+ ^ f(x  - h,  y)  - f(x  - 2h,  y) 

to  order  h^ . By  using  this  and  the  corresponding  relation  for 
in  (3.  3),  we  obtain 


f 

yy 


(x,y) 
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This  will  serve  very  satisfactorily  except  for  points  adjacent 
to  the  boundary.  For  these,  values  of  f(x,  y)  at  points  outside  the  region 
would  be  required.  If  such  are  available,  there  would  be  no  difficulty. 
However,  they  might  not  be  available. 

There  is  of  course  the  off-center  difference  approximation 

(3.  6)  h2fxx(x,  y)  a “ f(x  - h,  y)  - |f(x,  Y)  ~ \ f(x  + h,  y)  + ^ f(x  + 2h,  y) 

- 2 f (x  + 3h,  y)  + ^f(x  + 4h,  y)  , 

correct  to  order  h^.  Using  it,  and  the  corresponding  relation  for  y, 
we  could  derive 
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g 

which  is  valid  to  order  h ; on  the  right  of  (3.7)  the  "origin"  of  the  stencil 
is  the  square  which  contains  566. 

This  certainly  brings  a method  of  order  h^  within  reach.  One 
wonders  if  there  could  be  a better  stencil  than  that  on  the  right  of  (3.7). 
Perhaps  there  is  not,  but  we  will  investigate  what  is  available. 
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exist  for  S = 4.  We  shall  show  that  they  do  not  exist  for  S < 4.  For 


S = 4,  there  are  many  sets  of  a , and  we  shall  derive  the  general 
’ m,  n’ 


form. 


Because  the  right  side  of  (4.  l)  is  invariant  under  interchange  of 


x and  y,  if  a satisfy  (4.i),  then  so  would  a , where  we  take 

■*  ' n ' ' 7 m 7 


Then  so  would 


a = a 
m,  n n,m 


a = ~(a  + a ) . 

m,  n 2 m,  n m,  n 


So  we  lose  no  generality  in  assuming 


(4.2) 


a = a 
m,  n n,  m 


If  f(x,  y)  is  smooth  enough  to  have  a double  Taylor  series  out 
to  order  h , then 


S S 


r K 


(4.3)  Z Z a f(x  + mh,  y + nh)  s h Y — r~  D D f(x,  y) 
LJ  , m,  n ’ un  un  s (r  - s)  x y 

m = -l  n=-l  ’ r=0  s = 0 v 


to  within  terms  of  order  h , where  D and  D are  partial  derivatives 

’ x y 

with  respect  to  x and  y respectively,  and 


S S 


(4.4) 


l l -r 

m=-l  n=-l 


-s  s 
n a 


Because  of  (4.2),  we  have 


(4.5) 


K = K 
rs  sr 


Define 


(4.6) 


A = Z 

m <—* 


n=  -l 
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By  (4.  4) 


S 

(4.7)  K0=  l m\. 

m=  -1 

By  (4.3),  if  we  are  to  satisfy  (4.1),  we  must  have 


(4.8) 

Koo 

= 6 

(4.9) 

So 

= 0 

(4.  10) 

o 

(Nl 

= 1 

(4. 11) 

koa 

30 

= 0 

(4.12) 

KAn 

40 

_ 2 
5 

(4.13) 

50 

= 0 

By  (4.7),  this  is  a set  of  six  simultaneous  linear  equations  for 
the  A . If  S < 4,  they  have  no  solution.  So  we  take  S = 4,  for 
which  we  observed  earlier  that  there  is  a solution,  and  proceed.  The 
equations  (4.8)  through  (4.  l 3)  have  the  unique  solution 


A = — A = A = — 

-1  20’  A0  40  ’ A1  10’ 

7 3 1 

A = — A = - — A = — 

2 20  ’ 3 , 20  ’ 4 40  ' 


Analogously,  if  we  write 


(4.  14) 


B 

m 


z 


n=  -1 


na 

m,  n 


then  to  satisfy  (4.  l)  we  must  have 


(4.15) 


Z 


m=-l 


mr  *B  = K = 0 
m rl 


(1  < r < 5)  . 
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This  set  of  5 equations  has  the  one  fold  multiplicity  of  solutions 

B-i'-5B0'  V-2V  V2V  VV  VK 
We  write  also 


(4.16) 


C = X n2a 
m ^ , m,n 

n = -l  ’ 


To  satisfy  (4.1)  we  must  have 


(4.17) 


y.  c ’=  k_  = i 

u m 22 


ms-1 


(4.18) 


(4.19) 


ms-1 


mCm  = K32 


S 

2 m C = K4  = -~ 
m ^ , m 42  15 

m = -l 


(4. 20) 


A m C = K _ * 0 . 
u m 52 


m = -l 


This  set  of  4 equations  has  the  two  fold  multiplicity  of  solutions 
C 


_ li.lr  , lr  r , 11^2  5 

'-1  " 60  4 C0  + 4 °4'  °1  " 30  2 % 2 °4  ’ 


C2 


i-L  jl  r*  -L  RP  P"  JLL  _ X r*  JLS,  f* 

15  0 4’  3 60  4 0 4 4 

To  satisfy  (4.  l)  it  is  sufficient  as  well  as  necessary  to  satisfy 
equations  (4.  8)  - (4. 1 3),  (4.15),  and  (4. 17)  - (4.  20).  This  we  have 
accomplished,  and  with  three  parameters,  Bq,  C^,  and  C^,  at  our 
disposal. 
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Given  values  of  the  A , B , and  C , we  have  yet  to  determine 

m nr  m 


the  a . By  (4.2)  and  (4.6),  we  have 


(4.21) 


a n - A - a 

m,  0 m m,  -1  — ' m,  n 

n=  1 


4 

X a 


Except  for  m = 0,  this  expresses  a . (and  hence  ) in  terms 

’ m,0  0;nr 

of  a with  both  r + 0 and  s + 0.  Using  (4.  2)  with  (4.  2l)  gives 
r,  s 

(4.22)  a =A0-{A  -a  , f.  )-  l {An-a  ,-fa 

’ r=l  ’ n=l  ’ r=l  ’ 

So  also  a^  . is  expressed  in  terms  of  a with  both  r + 0 and  s + 0. 
0,0  r,  s 

If  we  add  and  subtract  (4. 14)  and  (4.  16)  we  will  get 

4 

(4.23)  a,  = ”•  {C  - B Y.  n(n  - l)a  } 

' -1 , m 2 m m m.n 

n=  2 


} 


(4.24) 


Except  for 


a = -7  {C  + B - Y n(n  + l)a  ) ■ 
l,m  2 m m m.n 

n=  2 


m = -1,  0,  and  1,  (4.23)  expresses  a_ 

~ 1 ? 


m 


(and  hence  a ) in  terms  of  a with  r > 2 and  s > 2.  If  we 
m,  -1  r,  s “ ~ 


take  m = - 1 in  (4.  23),  and  make  another  use  of  (4.  23),  we  get 

4 4 

2 

n=  2 " " r=  2 


(4-25)  r,,-!  = y <C_!  - B-!  - i l n(n  - !)[  cn  - B„  - Z /(r  - l)a  ]} 

* ? IT  J * 


If  we  take  m = 1 in  (4.  23),  and  make  a use  of  (4.  24),  we  get 

4 4 

Z 

n=  2 " " r-  2 


(4.26)  a = ~ (C  “ B - ^ Z ,n(n-l)[C  +B  - Z/(r+l)a  ]} 
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So,  except  for  m = 0,  we  have  a . ^ (and  hence  a J in  terms 
1 - 1 , m m,  -1 

of  a with  r > 2 and  s > 2.  Using  these  in  (4.  2l)  gives  also 


r,  s 

a „ in  terms  of  a with  r > 2 and  s > 2. 

-1,0  r,s 

Except  for  m = 0 and  1,  (4. 24)  or  (4. 26)  expresses  a in 

1 } m 

terms  of  a with  r > 2,  and  s > 2.  If  we  take  m = 1 in  (4.  24), 

r,  s 

and  make  another  use  of  (4.  24),  we  get 


4 


4 


(4.27)  a1>1  = l{C1+B1-'J  n(n+l)[Cn  + Bn- 

If  we  use  this,  (4.  26),  and  (4.  24)  in  (4.  2l)  we  get  also 
of  a with  r > 2 and  s > 2. 


Z 


r(r  + l)a  ]} 
n,rJ 


in  terms 


r,  s 

In  view  of  (4.2),  there  are  only  six  distinct  parameters  a 

r,  s 

with  r > 2 and  s > 2.  There  remain  yet  unused  three  of  the  eighteen 
original  equations.  It  would  be  expected  that  they  would  give  three 

more  conditions  among  the  a , but  surprisingly  they  turn  out  to  be 

r,  s 

dependent  on  the  other  fifteen.  This  is  due  to  the  particular  relations 

that  subsist  among  the  A , B , and  C , and  would  not  be  the  case 

m m m 

with  general  A , B , and  C . 

nr  m7  m 

Thus  consider  (4.23)  for  m = 0,  of  which  we  have  not  yet  made 
any  use.  If  we  substitute  from  (4.  23)  into  (4.  2 1 ) , we  will  get 


(4.28) 


l n(n-l)a  + Z (C,  - Bn> 

n = 2 ’ n-1 


-Z  Z 4r-l)a  > 

n-1  r=2 
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Making  use  of  a = a lets  us  write  the  final  term  as 


n,r  r,  n 


n=  2 

By  (4. 15)  and  (4. 17),  we  have 


2 n(n  - 1)  2 an>  r • 


r=  1 


C-fV  2,  <C«  ' V * 1 - <C0  - B0> 

n=  l 


Also,  from  the  given  values  of  the  A^,  we  have 


A_j  s 2 2 n<n  ’ !)AnJ  * 

n~2 


Putting  these  into  (4,  28)  gives 


a-l,0  5 a ^C0  ’ S0  " 2 n^n  " " a-l,n  " 2 an,r^  * 

r « x 

Use  of  (4.  2i)  converts  this  into  (4,  23)  with  m 3 0. 

Consider  next  (4.  24)  for  m ■ 0,  of  which  we  have  not  yet  made 
any  use.  If  we  substitute  from  (4.  24)  into  (4,  2i),  we  will  get 

(4.29)  a 0.A J-|(0  tl  - l »(»+!)..,  + l (C„  + Bn) 

n®2  n~l 

4 4 

"2  2 r(r  + i)an  r}  • 
n*l  r=2 

As  before,  w©  write  the  last  term  as 


4 

l 

ne  2 


'i  4 

2 + 1)  2 ®n  r . 

= 5 r«l  * 
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and  we  have 


C-I  + B-t  + 2 (Cn  t B ) = 1 - (C  + B ) , 
n=  1 

and 

4 

Aj  = 2 (l  - Z n(n  + l)An>  . 

n = 2 

Substituting  these  into  (4.29),  and  using  ( 4 . 2 1 ) , gives  (4.24)  with 

m = 0. 

Consider  finally  (4.24)  with  m = -1.  Refer  back  to  (4.26). 

We  have 


4 4 

E 

n = 2 r=  2 

4 4 

2 n(n  + 1)  Yj 
n=2  r=2 

Also,  use  of  (4. 15)  and  (4.  18)  gives 


‘i 

4 E ”(n-  1)  E r(r  + D»n,r 


= 4 Z n(n  + l)  Z r(r  ~ !)an  r 


i {C1  " B1  " t E n(n  - l)(Cn  + Bn)J 
n=2 


■ 2 (C-1  + B-, 


7 E n<n  + l)(Cn  - Bn)>  . 
n = 2 


Substituting  these  into  (4.26)  and  using  (4.23)  gives  (4.24)  with  m - -1. 
Thus  we  can  choose  a with  r j>  2 and  s > 2 at  will,  subject 


r,  s 


to  a = a . Then  we  can  substitute  gradually  back,  and  recover  all 


r,  s s,r 
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the  a . Recalling  that  we  have  also  the  free  parameters  B . C., 
r,  s 0 O’ 

and  C4,  we  see  that  there  is  a nine  fold  multiplicity  of  solutions. 

One  would  have  thought  it  possible  to  choose  a^  ^ = 0 ^or  n> 

thus  reducing  to  the  case  S = 3.  However,  as  A^  = l/40,  this  is 

4 

precluded  by  (4.  6). 

We  observe  that  if  the  principal  grid  point  is  at  a distance  h from 
the  left  edge,  but  further  than  that  from  the  top  or  bottom,  then  one  can 

i 

use  the  off  center  difference  approximation  in  the  x-direction  only.  We 

use  the  methods  given  above  to  see  if  one  can  get  a stencil  which  does 

not  extend  as  far^s  six  grid  points  in  the  x-direction.  It  turns  out 

that  one  cannot,  but  we  will  present  the  analysis  anyhow,  since  it  shows 

how  to  generate  all  possible  stencils. 

Without  causing  confusion,  we  can  use  the  same  letters  as  before, 

but  with  slightly  altered  denotations. 

So  for  our  a we  will  now  have  -2  < n < 2,  -1  < m < S.  In 
m,n  ~ ~ ’ ~ — 

place  of  (4.2),  we  will  have 

(4. 30)  a = a 

m, n m, -n 

All  summations  on  n should  be  from  -2  to  +2.  Specifically,  this 

change  should  be  made  in  (4.  3),  (4.4),  (4.6),  (4.14),  and(4.i6).  Delete  (4.  5). 

As  before,  we  see  that  we  must  have  S 4.  Taking  S = 4,  we 

get  the  same  values  of  A as  before. 

m 

By  (4.  30)  and  (4. 14),  we  have  = 0 for  all  m.  Thus  (4.  15) 
is  trivially  satisfied. 
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We  get  the  same  determination  as  before  for  the  C . 
Finally,  we  write 


(4.  31) 


D 

m 


l 


n=  -2 


4 

n a 


m,n 


We  must  have 


(4.32) 

D + D 

-1  < 

(4.33) 

-D  + D 

0 1 2 3 4 5 

2D_  + 3D_  + 4D  . = 0 . 
2 3 4 


Given  the  A , B , C , and  D , there  is  no  question  how  to 
m’  nr  m7  m 

determine  the  a . We  have  immediately 
m,  n 


a , = a _ 
m,  -2  m,  2 


D - C 
m m 

24 


4C  - D 
m m 

am,  -1 5  &m,  1 6 ’ 


a = A - 2a  -2a  0 . 

m,  0 m m,  1 m,  2 

Thus  we  can  easily  determine  sets  of  a . There  are  18  distinct 

rn,  n 

a . As  they  do  not  depend  on  the  B , it  apoears  that  we  have  a six 
m,n  nr 

fold  multiplicity  of  solutions.  It  is  surprising  that  this  does  not  permit 

the  choice  a . = a . , = a . which  would  let  us  reduce  S to  3. 

4,  0 4,  1 4,  2’ 


5.  Regions  of  unusual  shape.  We  have  been  using  squares  for  our 
grid.  There  are  cases  where  this  is  really  impractical.  For  example, 

suppose  our  region  is  a rectangle  of  sides  1 and  \Tz.  For  rectangles  of 
intractable  proportions,  a way  of  handling  the  matter  easily  is  provided  in 
Rosser  [ 1 ] . Beyond  that,  we  have  not  pushed  our  investigations. 
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Abstract 


This  paper  is  concerned  with  the  problem  of  evaluating  the 
effectiveness  of  existing  and  conceptual  illuminating  rounds.  The 
illuminating  round  effectiveness  model  developed  at  Picatinny  Arsenal 
essentially  consists  of  three  parts:  a ballistic  portion  which  is 

concerned  with  round  deployment  and  descent,  an  illumination/atmospheric 
portion  which  calculates  target /background  visual  contrasts  at  all 
points  in  search  area  during  the  entire  flare  burn  time,  and  a human 
visual  perception  portion  which  determines  from  the  target /background 
contrast  the  glance  target  detection  and/or  recognition  probabilities. 
Effectiveness  measures  are  evaluated  by  integrating  suitable  functions 
of  the  glance  target  detection/recognition  probabilities  over  the  search 
area  and  flare  burn  time. 

In  particular,  this  paper  will  concentrate  on  developing  the 
illuminating  round  effectiveness  measures  built  into  the  model. 

Basically,  two  different  measures  of  effectiveness  are  employed.  The 
first  measure,  referred  to  as  the  time  integral  of  the  area  illuminated 
(TIAI),  has  the  dimensions  of  area-time  and  is  meaningful  when  it  is 
desired  to  evaluate  the  ability  of  an  illuminating  round  to  maintain 
illumination  over  a given  search  area.  The  second  measure  of  effectiveness, 
referred  to  as  effective  area  (EA),  has  the  dimension  of  area  and  is,  in 
spirit, analogous  to  the  lethal  area  concept  employed  extensively  in 
evaluating  the  effectiveness  of  HE  munitions.  This  effectiveness  measure 
is  particularly  useful  when  it  is  desired  to  evaluate  the  ability  of  an 
illuminating  round  to  enable  detection  or  recognition  of  unknown  targets 
in  a sepcified  search  area. 

An  application  of  the  model  to  an  hypothetical  incremental  flare 
illuminating  round  is  included  to  illustrate  concepts  developed. 

Survey  of  Illumination/Atmospheric  and  Visual  Perception  Sub-Models 

The  scenario  employed  in  the  model  is  illustrated  in  Fig.  1. 

Though  the  figure  depicts, as  an  example,  a ground  launched  round,  the 
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model  can  just  as  readily  consider  air  dropped  flares.  The  model 
assumes  a flat  rectangular  search  area  which  is  to  he  illuminated 
by  the  round.  Inputs  to  the  model  include  parameters  to  describe 
the  characteristics  of  the  flare,  parameters  to  define  the  scenario, 
parameters  to  specify  target /background  characteristics,  and 
parameters  to  characterize  the  atmosphere.  Table  1 gives  a detailed 
list  of  the  inputs  necessary  to  drive  the  model. 

TABLE  1 

INPUTS  TO  THE  MODEL 


Flare  Parameters : 

Candle  Intensity  Vs  Time 

Burn  Rate  Vs  Time 

Pyrotechnic  Weight 

Non -Combustible  Weight 

Initial  Stabilized  Flare  Descent  Rate 

Scenario  Parameters: 

Search  Area  Dimensions  and  Location 

Location  of  Observers 

Number  of  Observers 

Observer  Scan  Rate 

Initial  Flare  Position 

Target /Background  Parameters: 

Target  Size 

Target  Reflectivity 

Background  Reflectivity 

Atmospheric  Parameters: 

Normalized  Volume  Scattering  Function 
Meterological  Range  (Visibility) 

Wind 

As  shown  in  Figure  2,  targets  in  search  area  are  modeled  as 
having  two  reflecting  surfaces;  a horizontal  surface  parallel  to 
the  ground  plane  and  a vertical  surface  oriented  such  that  its 
normal  is  pointed  in  the  direction  of  the  projection  of  the  observer's 
location  into  the  ground  plane.  Target /background  contrasts  and 
detection/recognition  probabilities  are  calculated  independently  for 
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each  surface. 


The  illumination  geometry  is  portrayed  in  Figure  3.  Light 
reaching  the  observer  comes  from  three  sources:  light  reflected 

from  the  target  surface  as  the  result  of  direct  illumination  by 
the  flare,  light  from  the  flare  scattered  into  the  visual  path 
between  the  target  and  the  observer  which  gives  rise  to  path 
luminance,  and  glare  which  essentially  represents  direct  light 
from  the  flare  reaching  the  observers  eye.  The  apparent  target/ 
background  contrast  is  defined  as: 

bT  ' Bb  | 

C = (1) 


apparent  target /background  contrast  as 
seen  by  the  observers. 

apparent  target  brightness  as  seen  by  the 
observers  ( f ootlambert s ) . 

apparent  background  brightness  as  seen  by 
the  observers  (footlamberts  ). 

In  calculating  the  apparent  brightness  of  the  target  and  background 
it  is,  of  course,  necessary  to  account  for  the  light  attenuation 
that  results  from  atmospheric  scattering  (light  absorption  in  the 
visible  spectrum  is  insignificant  and  is  neglected).  It  is  also 
important  to  note  that  the  model  accounts  for  only  single  scattering 
and  neglects  indirect  illumination  and  other  effects  due  to  multiply 
scattered  light. 

From  the  apparent  target/background  contrast  one  computes  the 
target  detection  probability.  This  computation  is  performed  with  the 
aid  of  the  extensive  work  performed  by  Blackwell. 1*2  Using  Blackwell's 
results,  the  detectability  at  time  t of  a target  located  at  (x,  y)  in 
the  search  area  (i.e.,  the  conditional  target  detection  probability 
given  the  nresence  of  a target  and  a observer  looking  at  the  location) 
is  given  by: 


where  C 


Bn 


tc  (x, y, t) /CL (x , y)-l ] / . 39 


- Oo 


where  C (x,  y,  t)  = the  actual  apparent  target /background  contrast  at 

time  t for  a target  located  at  (x,  y). 
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C (x,  y)  = the  liminal  target /background  contrast, 

(i.e.,  contrast  required  for  50%  target 
detection  probability)  for  a target  located 
at  (x,  y). 

In  general,  the  liminal  target /background  contrast  is  a function 
of  the  visual  angle  subtended  by  the  target  to  the  source,  the  back- 
ground brightness  which  determines  the  visual  adaptation  level,  and 
the  time  duration  that  the  location  is  examined.  Fig.  4 gives  the 
liminal  target /background  contrast  used  in  the  study.  It  is  based 
on  a 1/3  second  observer  glance  duration. 

The  probability  of  target  recognition  given  detection,  Pr, 
depends  upon  the  additional  ability  of  the  observer  to  resolve  indi- 
vidual portions  of  the  target  so  that  its  shape  can  be  ascertained. 

For  target  detection,  all  that  is  required  is  for  the  observer  to  be 
able  to  identify  an  anomaly  in  the  terrain  features.  The  expression 
used  for  the  conditional  probability  of  recognition  given  target 
detection  is  taken  from  the  MARSAM  11^  study: 

{(N(x,y,t)-3.2)2 

1-exp N>3.2 

11  (3) 

0 N < 3. 2 

where  N (x,y,t)  denotes  the  number  of  resolvable  elements  across 
the  target  surface.  N(x,y,t)  depends  upon  the  angle  subtended  by  the 
target  to  the  observer  and  visual  adaptation  level.  The  product 
P(x,y,t)*Pr  (x,y,t)  gives  the  unconditional  glance  probability  of 
target  recognition. 

Target  detection  and/or  recognition  glance  probabilities  are 
computed  independently  for  both  the  horizontal  and  vertical  target 
surfaces.  Depending  upon  the  intended  application, either  the  horizontal 
surface  detection  or  recognition,  vertical  surface  detection  or  recog- 
nition or  maximum  detection  or  maximum  recognition  probabilities  can  be 
used  in  computing  the  illuminating  round  effectiveness  measures.  For 
simplicity,  in  the  derivations  that  follow  we  will  use  one  of  the  glance 
target  detection  probabilities,  P(x,y,t). 

Measures  of  Effectiveness 


The  first  effectiveness  measure  to  be  considered  is  the  time 
integral  of  the  area  illuminated,  TIAI.  At  a given  time,  t,  during 
the  flare  burn,  one  can  weight  each  differential  area,  dA,  in  the  search 
region  by  the  target  detectability,  P(x,y,t),  and  integrate  over  the 
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entire  search  area  to  obtain  the  area  illuminated  at  time  t,  AL(t). 


AL(t) 


/ 

Search 

Area 


P(x,y,t)dA 


(M 


The  significance  of  Eq.  becomes  apparent  if  one  envisions  the 
search  area  to  be  populated  with  targets  of  uniform  area  density  p 
targets/m2.  Thus  p P(x,y,t)dA  is  the  expected  number  of  detectable 
targets  in  dA  and  hence,  the  expected  number  of  detectable  targets 
in  the  search  area  at  time  t is  given  by  p AL(t). 


Integrating  Eq.  k over  the  burn  time  of  the  flare  gives  rise 
to  the  time  integral  of  the  area  illuminated. 


TIAI  = 


/{  / 

n 


P(x,y,t)dA 


Burn 

Time 


Search 

Area 


} 


dt 


(5) 


The  maximum  possible  value  for  TIAI  is  the  product  of  the  search 
area  and  the  flare  burn  time.  Normalizing  by  this  quantity  yields 
an  effectiveness  measure  whose  value  lies  between  0 and  1 and  repre- 
sents the  average ,over  search  area  and  flare  burn  time,  of  the  target 
detectability.  This  figure  of  merit  is  most  useful  in  evaluating 
the  ability  of  a round  to  maintain  adequate  illumination  during  the 
burn  over  the  entire  search  area. 


Average  detectability  = 


TIAI 

AT 


(6) 


In  order  to  develop  the  notion  of  the  Effective  Area,  EA,  of  an 
illuminating  round  in  a given  scenario,  the  concent  of  the  observer 
scan  rate  must  be  introduced.  The  scan  rate  denoted  by  a , is  simply 
the  area  scanned  by  each  observer  per  unit  time.  We  assume  that  the 
scanning  procedure  is  random  and  that  the  observers  are  independent. 
Hence  in  differential  time,  At,  the  probability  that  a given  point 
(x,y)  in  the  target  area  is  examined  is  given  by: 


Prob.  (glimpse  at  (x,y)  in  At)  = 


(7) 


The  probability  of  detecting  a target  at  (x,y)  not  previously  detected 
in  the  time  interval  t to  t + At  is  thus  given  by: 


Prob 


Target  at  (x,y) 
detected  between , 
t ,t+At 


Not  detected 
up  to  time  t 


= P(x,y,t) 


naAt 

A 


(8) 


339 


Let  K(x,y,t)  denote  the  probability  of  detecting  a target  located 
at  coordinates  (x,y)  in  the  time  interval  0 to  t,  and  let  K (x,y,t) 

= 1 -K(x,y,t)  denote  the  probability  of  not  detecting  the  target  _ 
by  time  t.  We  then  obtain  the  following  difference  equation  for  K (x,v,t) 


K(x,y  t+At)  = K(x,y,t)  [ 1-P  (x,y,t)  ngAt  ] 

A 


(9) 


This  difference  equation  results  from  the  fact  that  for 
to  be  detected  by  t + At,  it  must  not  be  detected  up  to 
detected  in  the  interval  t to  t + At.  Taking  the  limit 
one  obtains  the  differential  equation; 


a target  not 
t a^id  not 
as  'At  0, 


d _ 

— K(x,y,t)  = -P  (x,y,t)  22. 
dt  A 


K(x,y,t) 


(10) 


The  solution  to  this  equation  subject  to  the  initial  condition  that 
at  t = 0,  K(x,y,t)  = 1 is  given  by: 


na 


K(x,y,t)  = e 


f 3 (x » y > 

n 


n)dq 


(li) 


Hence  the  probability  of  detecting  a target  located  at  coordinates 
(x,y)  over  the  total  flare  burn  time  T is  given  by 


K(x,y,T) 


1 


e 


na 


A 


T 

P(x,y, t)dt 


(12) 


Finally,  the  probability  of  detecting  a target  in  dA  over  the  flare 
burn  is  given  by  the  product  of  the  probability  that  there  is  a 
target  in  dA  and  the  conditional  detection  probability  given  the 
target  is  in  dA. 


K(x,y,T)  pdA 

The  expected  number  of  targets  detected  is  thus  given  by 
Expected  number  of  targets  detected 


(13) 


= P f [l-i 

J A 


na 

A ^ o 


f T 

/ P(x,y,t)dt 


] dA 


(lb) 


The  actual  number  of  target  detection  would  be  Poisson  with  the 
above  expectation. 
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As  an  effectiveness  measure  one  can  thus  take  the  expression 


f T 

EA  = f [1 -e~na^XJ  P ( x » T » t ) d t j 

■'a  ° 


dA 


(15) 


The  above  quantity  referred  to  as  the  Effective  Area,  has  the  dimension 
of  area.  Its  value  is  between  0 and  the  size  of  the  area  being 
illuminated.  One  can  define  a flare  efficiency  by  considering  the 
ratio  of  EA  to  either  the  total  search  area  or  the  expected  area 
scanned  by  the  observers  during  the  search.  The  first  normalization 
yields:  T 

EA  /a  U-e-n“''A/P(x->r-t)‘JtldA 

- — ” (ns) 


A 


A 


For  a uniform  target  area  density,  this  quantity  can  be  inter- 
preted as  the  ratio  of  the  expected  number  of  targets  present  in 
the  search  area. 


An  alternate  interpretation  is  possible  for  the  case  where  it  is 
desired  to  find  a single  target  located  in  the  search  area . The 
conditional  probability  that  the  target  is  detected  given  that  it  is 
located  at  (x,y)  is  given  from  Eq.  12  as: 


{Prob.  of  detecting 
target  at  (x,y) 


1 


e 


T 

(x , y , n) dn 


(17) 


The  probability  that  the  target  is  in  dA  is,  assuming  all  locations 
are  equally  likely,  clearly  given  by  dA/A.  Hence  integrating  over 
the  search  area  demonstrates  that  EA/A  also  represents  the  probability 
of  finding  a single  target  located  randomly  in  the  search  area. 


A potential  difficulty  with  Eq.  l6  is  that  it  represents  the 
combined  capabilities  of  the  observers  as  well  as  the  flare.  From 
Eq.  15,  one  can  obtain  the  expected  total  area  scanned  by  all  the 
observers  by  setting  P(x,y,t)  equal  to  1 

naT 

Expected  Area  Scanned  = A[l-e  ^ ] (l8) 


If  the  expected  total  scanned  area  is  small,  EA/A  will  be  small  no 
matter  how  good  the  actual  flare  performance . To  account  for  the 
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possible  limitation  of  the  observers  to  scan  the  entire  search  area, 
one  can  evaluate,  for  the  single  target  case,  the  conditional 
probability  that  the  target  is  detected  given  that  it  is  scanned. 
This  probability  is  given  by: 

P (target  detected/target  scanned) 

_ P (target  detected) 

P (target  scanned) 


The  probability  that  the  target  is  detected  is  given  by  EA/A,  and  the 
probability  that  the  target  is  scanned  is  obtained  from  Eq . 11  by 
setting  P(x,y,t)  equal  to  1.  Performing  these  substitutions  we  obtain: 


P (target  detection/target  scanned) 


(20) 


EA 

nctT 

Ad  -.•"**> 


= Effective  Area 

Expected  Area  Scanned 


The  effectiveness  expressions  obtained  in  this  section  are  illustrated 
below  with  the  aid  of  an  example. 


Example 

Consider  a Hypothetical  Incremental  Flare  whose  candle  power  and 
descent  altitude-time  profile  is  shown  in  Fig.  5.  The  scenario  consists 
of  a search  area  which  is  a square  3000  feet  on  each  side  and  a single 
observer  who  can  scan  10^  ft^/sec.,  located  at  an  altitude  of  200  feet, 
6500  feet  back  from  the  center  of  the  search  area.  The  reflectivity  of 
the  target  is  O.lU  and  the  background  is  0.10.  The  target  diameter  is 
15  feet.  The  flare  is  dropped  directly  over  the  center  of  the  search 
area,  wind  drift  is  assumed  to  be  zero,  and  the  atmospheric  visibility 
is  15  miles. 


Figure  6 shows  the  glance  target  detection  probability  that  exists 
30  seconds  after  flare  initiation.  Notice  the  rapid  fall  off  in  glance 
detection  probability  on  the  front  lighted  portions  of  the  search  area. 
Figure  7 depicts  the  area  illuminated,  Ea . U,  as  a function  of  flare 
burn  time,  and  Figure  8 depicts  the  effectiveness  measures  as  a function 
of  flare  burn  time.  Curve  A indicates  that  the  maximum  average  target 
detectability  occurs  early  in  the  flare  burn.  It  may  be  desirable  to 
design  the  candle  to  provide  a high  target  detectability  at  the  beginning 
of  its  burn  before  countermeasure  actions  can  be  taken  by  the  enemy. 

A flat  curve  would  indicate  a flare  design  which  provides  constant 
average  target  detectability  during  its  entire  burn.  Curves  B and  C show 
the  probabilities  of  single  target  detection  up  to  a given  time.  The 
two  curves  converge  since,  for  the  parameters  used,  the  probability  of 
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scanning  the  entire  search  area  during  the  entire  flare  burn  is 
virtually  1, 

Conclusions 


The  model  described  in  this  paper  is  a general  purpose  system 
model  developed  at  Picatinny  Arsenal  for  the  purpose  of  evaluating 
illuminating  round  effectiveness.  The  model  has  already  been 
successfully  applied  to  answer  questions  regarding  fuze  accuracy 
requirements.  The  model  should  be  particularly  applicable  in 
evaluating  advanced  illuminating  round  design  concepts  such  as  the 
incremental  flare.  In  fact  the  model  can  be  used  to  evaluate  the 
relative  importance  of  each  of  the  design  parameters  and  thus  enable 
intelligent  decisions  regarding  the  allocation  of  development  funds. 

The  model  can  also  be  applied  to  questions  of  doctrine,  tactics  and 
the  development  of  firing  tables. 

Comprehensive  documentation  of  the  model,  including  the  computer 
code,  is  given  in  Reference  l*. 
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LAUNCHING  OF  ELECTROMAGNETIC  SURFACE  WAVES  AT  A PLANAR  METAL-AIR  INTERFACE 


J.  M.  Zavada  and  E.  L-  Church 
Pitman* Dunn  Laboratory 
Frankford  Arsenal 
Philadelphia,  PA  19137 


ABSTRACT.  We  consider  the  two-dimensional  problem  of  a line  source  located 
in  air  above  a homogeneous  metal  half  space.  The  incident  field  is  represented 
as  an  angular  spectrum  of  plane  waves,  and  integral  expressions  for  the  reflected 
and  transmitted  fields  are  obtained  by  satisfying  the  usual  electromagnetic 
boundary  conditions  at  the  interface  for  each  plane  wave*  The  properties  of  these 
fields  are  examined  in  the  far-field  region  by  evaluating  the  integrals  by  the 
method  of  steepest  descents,  including  the  effects  of  branch  cuts  and  poles.  This 
method  indicates  that  for  certain  values  of  the  complex  permittivity  of  the  metal, 
slow  Zenneck  surface  waves  are  excited.  These  waves  decay  exponentially  away  from 
the  interface,  propagate  along  the  surface  with  a phase  velocity  less  than  that  of 
light,  and  have  a transverse  attenuation  length  determined  by  the  conductivity  of 
the  surface.  An  expression  is  derived  for  the  power  carried  off  by  these  surface 
waves , 


1.  INTRODUCTION.  In  recent  years  there  has  been  considerable  interest  in 
the  generation  and  propagation  of  electromagnetic  surface  waves  at  optical  and 
infrared  frequencies.  These  waves  are  characterized  by  an  exponential  dbcay  away 
from  the  guiding  surface  and  have  been  excited  on  a number  of  open-boundhry  struc- 
tures. 1 In  particular,  these  surface  waves  have  been  generated  directly  on  corru- 
gated or  dielectric  coated  planes;  and  indirectly,  by  means  of  a prism  coupler, 

on  smooth,  uncoated  plane  surfaces.^  The  main  interest  in  these  waves  lies  in 
their  potential  application  as  surface  probes^  and  in  various  electronic  devices.^ 

These  surface  waves  may  also  play  a significant  role  in  the  increased  ab- 
sorption of  mirrors  due  to  surface  microroughness . ^ To  determine  their  actual 
importance,  it  is  necessary  to  calculate  the  amount  of  energy  that  is  coupled 
into  these  waves  by  surface  corrugations  or  roughness.  Our  approach  to  this 
problem  has  been  based  on  a classical  EM  scattering  formalism  which  was  develop- 
ed for  radar  scattering,  augmented  with  a perturbation  expansion  appropriate  to 
small  scales  of  roughness. ^ 

In  our  investigation  of  the  effects  of  surface  roughness,  we  have  found 
that  electromagnetic  surface  waves  can  be  excited  on  a smooth,  uncoated  plane 
surface  without  the  use  of  a prism  coupler, ^ The  present  paper  demonstrates 
this  result  and  specifies  the  conditions  which  are  necessary  for  the  launching 
of  these  waves.  This  paper  also  outlines  the  general  formalism  used  in  our 
continuing  studies  of  slightly  rough  surfaces. 

2 , PLANE  WAVE  REPRESENTATION  OF  FIELDS » We  treat  the  two-dimensional  prob- 
lem of  a line  source  situated  in  air  a distance  h above  a homogeneous  half  plane 
(Fig.  1).  The  permeability  of  the  entire  plane  is  that  of  the  vacuum,  [iQ.  The 
permittivity  of  air  is  eQ  (the  free  space  value)  and  that  of  the  medium  is 
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s = (e.  - ieg)  e0  = eeQ.  We  allow  e to  be  a complex  quantity  to  allow  for  possible 

losses  in  the  medium.  The  line  source  has  a time  dependence  e1U}t  (which  will  be 
suppressed  in  the  following)  and  is  specified  by: 


(1) 


Hz1(x,y)  = f~  exp  (-  531)  Hq  \koir') 


= 0 


where  (x,y)  are  the  coordinates  of  the  observation  point;  r;  is  the  distance  from 
the  source;  kQ  = 2tt /X  and  Hq  * ^ (..)  is  the  zeroth  order  Hankel  function  of  the 
second  kind.  We  have  chosen  a p-polarized  line  source  since  only  this  polariza- 
tion will  lead  to  surface-wave  excitation  in  this  particular  problem. 


The  first  step  in  our  calculation  is  to  represent  the  incident  fields  in 
terms  of  a spectrum  of  plane  waves. ® This  procedure  is  closely  related  to  the 
Fourier  expansion  of  the  fields  and  originates  in  the  work  of  Sommerfeld,  Debye 
and  Weyl,  In  fact,  one  of  the  standard  integral  representations  of  the  Hankel 
function  is  in  the  form  of  a plane  wave  expansion: 


(2) 


(2) 

H0  <x,y) 


exp(+  i kQ(x  cos  a-  + y sin  a))  dev 


where  the  sign  convention  (-  for  y > 0,  + for  y < 0)  is  chosen  to  guarantee  con- 
vergence of  the  integral.  In  Eqn.  (2),  a is  a complex  angle  and  C is  the  path 
of  integration  in  the  a.  plane  (Fig.  2).  Then,  selecting  the  representation 
which  is  valid  fpr  the  half-space  below  the  source,  we  obtain  the  plane  wave 
expansion  for  H ZL; 


(3)  Hz1(x,y)  = ^ J*  P(cO  exp (i  kQ  (x  cos  a + y sin  a) ) dev 

where  K - J%  exp  (-  ^H)  and  P(a)  “ exp(-  i kQ  h sin  cy).  Here  p(a)  is  the  spectral 
density  of  the  field^and  the  exponential  factor  is  a plane  wave.  Since  a can 
be  a complex  quantity,  we  have  two  types  of  plane  waves  in  the  spectrum.  When 
Qi  is  real,  we  have  the  usual  homogeneous  plane  wave  of  constant  amplitude.  When 
a is  a complex  angle,  we  have  an  inhomogeneous  wave,  i.e.,  the  planes  of  constant 
phase  no  longer  coincide  with  the  planes  of  constant  amplitude. 

In  order  to  obtain  the  reflected  and  transmitted  fields,  we  satisfy  the 
usual  EM  boundary  conditions  at  the  surface  for  each  plane  wave  of  the  spectrum. 
This  leads  to  the  following  representation  of  the  reflected  field  H2 : 
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(4) 


h*  (x>y) 


p„(a)  P(a)  exp(i  kn(x  cos  a - y sin  a))  da 
H u 


PH(a) 


e sin  g - v-'e  - cos^  a 
e s in  a + */e  - cos'  a 


where  p (cy)  is  the  Fresnel  reflection  coefficient  for  a p-polarized  plane  wave. 
Similarly,  the  transmitted  field  Hz  is  represented  by; 


(5) 


aj*  K f * i ^ 

H (x,y)  = - t (a)  P(a)  exp(i  k (x  cos  a + yy'e  - cossa))dcki 
TT  u c n.  u 


TH(Qf)  ^ 


where  th(qO  is  the  Fresnel  transmission  coefficient  for  p-polarization, 

3.  SADDLE  POINT  INTEGRATION.  Since  the  integral  expressions  for  and 
cannot  be  solved  in  closed  form,  we  investigate  the  properties  of  these  fields 
by  the  method  of  steepest  descents*  In  this  method,  the  original  path  of  integra- 
tion in  the  a plane  is  deformed  into  the  path  of  steepest  descents , ^ and  then 
an  asymptotic  expansion  is  performed  to  obtain  the  far-field  behavior  of  the 
fields.  However,  if  there  are  any  singularities  in  the  integrand  (poles  or 
branch  points),  Cauchy fs  theorem  requires  that  we  include  their  contribution  to 
the  integral.  Deforming  the  contour  C into  the  path  of  steepest  descents  S, 
then  leads  to  the  general  result ; 


(6) 


4-  2mi  E Res , 


In  Eqn.  (6)  f represents  the  integration  along  the  branch  cuts  which  are 
crossed  and  fC*  E Res,  is  the  contribution  from  the  residues  of  the  integrand 
at  the  poles  which  are  encountered.  The  integral  along  S,  J , yields  the 
radiation  field  and  leads  to  lateral  waves.  The  residues  represent 

the  surface  waves  which  are  excited  .Therefore , the  saddle  point  integration  gives 
us  a means  of  distinguishing  the  surface  waves  in  the  resulting  fields.  Also 
we  notice  that  the  fields  will  contain  surface  waves  only  if  the  integrand  has 
a pole  which  is  encountered  as  the  contour  is  deformed. 

First,  we  apply  this  method  to  the  reflected  field  Hz . It  is  convenient 
at  this  stage  to  introduce  polar  coordinates  (R,  i ) defined  as  in  Fig.  1; 


(7) 


R cos  i = x 
R sin  | = y + h 


Then  Eqn.  (3)  can  be  rewritten: 

(8)  Hz  = n !"cpH^a^  exP(i  k0  R cos(V  +a))da 
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The  saddle  point  is  easily  found  to  be  c*s  = tt  - f and  the  curve  of  steepest 
descent  passing  through  arg  is  defined: 

cos  (a  - a ) cosh  u = 1 
(9) 

sin  (a  - arg  ■ tanh  p 

where  o - Re  a and  p = Ira  a.  This  curve  is  denoted  by  S(a„)  and  is  shown  in 
Fig.  3. 

12 

The  only  poles  in  the  integrand  of  Eqn.  (8)  are  those  of  p^(a).  From 
Eqn.  (4),  we  find  that  these  poles  are  defined  by;  r 


(10) 


cos  QTp  - + j e 

/ 1 + e 


sin  cv 

P 


1 


Two  of  these  poles  are  relevant  to  the  present  discussion.  If  we  let  Jo] 1 + e = 
A - iB  and  1/yi  + e = Dj  + iD^ , then  these  two  poles  aQ  and  are  given  by: 


(11) 


Qt  = TT  + CT  + i -n 
17  P p 


tan  a 

P 


D 


1 


A 


tanh  i? 

P 


D 


2 


A 


The  approximate  location  of  these  poles  in  the  a plane  is  shown  in  Fig*  3* 

The  coefficient  p (cv ) also  has  branch  point  singularities  which  are 
defined: 

(12)  cos  orfi  = + Je 

These  branch  poinus  and  the  appropriate  branch  cuts  are  also  displayed  in  Fig,  3. 

Referring  to  Fig,  3,  we  have  the  following  situation:  The  original  contour 

C is  deformed  into  the  path  of  steepest  descents  S (TT  - \jj)  which  passes  through 
the  saddle  point  c^-  As  we  vary  our  point  of  observation  (ty  goes  from  0 to  tt), 
we  sweep  out  the  region  shaded  in  the  figure.  In  doing  so,  we  pass  over  branch 
points  and  this  integral  along  the  branch  cuts  contributes  lateral  waves  to  the 
field*  Also,  depending  on  the  material  properties  of  the  medium  (the  value  of  e), 
we  may  encounter  a pole  which  will  give  rise  to  a surface  wave. 
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From  Eqns.  (9)  and  (10),  it  can  be  shown  that  if  A,  the  real  part  of 
JUl  + e,  is  greater  than  unity,  then  the  poles  aQ  and  will  lie  in  the 
shaded  region  and  a surface  wave  will  be  excited.  In  this  case  there  is  a 


critical  angle  i|fc  which  delineates  the  regions  in  which  the  surface  wave  will 
appear  in  the  reflected  field.  For  an  angle  satisfying  either  ijf  < i|f  or 
tt  - i(i  < i|fc,  a pole  will  be  encountered  in  the  deformation  of  the  contour  C. 
Physically,  this  means  that  the  surface  wave  is  important  in  determining  the 
far-field  expansion  of  only  in  regions  near  the  surface^  (Fig.  4). 


Since  the  value  of  Re  Jzl 1 + e is  essential  in  determining  the  existence 
of  surface  waves  in  this  problem,  we  restate  this  condition  in  terms  of  e^,  and 
62-  By  solving  the  equation: 


we  obtain  the  curve  in  the  complex  e plane  shown  in  Fig.  5.  The  shaded  region 
corresponds  to  values  of  A > 1.  Then,  if  the  material  has  dielectric  response 
such  that  (ej_,  £2)  is  a point  in  the  shaded  region,  surface  waves  will  appear 
in  the  reflected  field. 


The  transmitted  field  is  treated  in  much  the  same  way.  However,  since  the 
integral  contains  the  factor  exp  (i  kQ  y */e  - cos ) , the  saddle  point  integra- 
tion of  this  field  is  more  complicated.  Nevertheless,  the  decomposition  of  Hz 
into  a radiation  field,  a lateral  wave  and  a surface  wave  remains  valid.  The 
surface  wave  again  results  from  the  residue  of  the  integral  at  either  or  o^.. 
Also,  the  conditions  for  the  appearance  of  the  surface  wave  in  Hz  are  the  same 
as  those  that  apply  for  the  reflected  field. 

4.  SURFACE  WAVES.  We  shall  now  examine  the  surface  waves  that  are  excited 
along  the  positive  x axis  (\jr  < ^rc)  when  Re  e / ( 1 + e)  >1.  By  calculating  the 
residues  of  the  reflected  and  transmitted  fields  at  0^  “ n + a + i up , we 
obtain:  ' 

H'  (x,y)  = H0  exp(-i  kQ U e ~ x - yA/1  + e) ) x > 0,  y > 0 

1+e 

(14)  H " (x,y)  = H exp (-i  k (J  e x - ~^==))  x > 0,  y < 0 

1+e  vl  + e 
3/2 

H0  = 2fj2rri  2 ' exp(i  kQ  h/Jl  + e) 


The  surface  wave  in  Eqn.  (14)  belongs  to  the  general  class  of  Zenneck  surface^ 
waves  which  are  defined  by  the  vanishing  of  the  reflection  coefficient  p^(a). 

The  field  represents  an  inhomogeneous  plane  wave  incident  on  the  surface  at 
a complex  angle  + i u - At  this  angle,  there  is  no  reflected  wave  only 

the  transmitted  inhomogeneous ^plane  wave  Hg-  It  is  impossible  to  excite  these 
surface  waves  on  a smooth,  uncoated  surface  with  incident  homogeneous  plane  waves. 
However,  the  plane  wave  spectrum  of  the  line  source  contains  both  homogeneous 
and  inhomogeneous  waves,  and  it  is  the  latter  which  lead  to  the  surface  wave  in 
Eqn,  (14).  5 
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In  general,  the  fields  e'  and  H'z  will  exhibit  exponential  attenuation  in 
both  the  vertical  and  horizontal  directions.  The  horizontal  attenuation  length 
is  the  same  for  either  field  and  is  given  by: 


(15) 


1 

kQ  B 


B - - Im  v;c/l  + e 


When  the  medium  is  lossless  (02  = 0),  becomes  infinite  and  there  is  no  hori- 
zontal attenuation.  For  the  field  above  the  surface,  U',  the  vertical  attenua- 
tion length  is: 

V 


(16) 

D2  = Im(lA/l  + e) 


and  below  the  surface  the  attenuation  length  is : 


(17)  

G = - Im(e/yi  + G) 


However,  even  for  a lossless  material,  both  i'  and  JL"  remain  finite,  i.e. 

7 p V)  7 

the  wave  is  still  bound  to  the  surface. 

The  phase  velocity  of  the  surface  wave  in  the  x direction  is  the  same  on 
either  side  of  the  interface  and  is  given  by: 


x _ “L  _ £ 
% “ *Sc  “ A 


(18) 


A = Re  / e 


1 + 


Since  A > 1,  the  phase  velocity  is  less  than  c and  the  surface  waves  are  slow 
waves  f 

The  time  averaged  power  flux  < S > is  easily  calculated  from  the  following 
equation : 

_ _ _ 'k 

(19)  <S>=%Re[ExH  ] 

For  the  field  above  the  surface  (x  > 0,  y > 0),  we  find: 
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< s >'  = % JTq  Re  ' r~r~ , - -=  , 0]  I u'  f 
e“  L/  1 + e V1  + e J 


I kJ 


f - 2x  _ 2 (y  + h 
exp  (j-jj  — 


Below  the  interface  (x  > 0,  y < 0)  the  power  flux  is: 


< S > 


= k M TTTf 


e + e’ 


rn  ’ °]  I H* 


exP  l "J"  + 

V "T1 


-2x  , 2y  2h  'N 

■X  + i"  ■ T V 

n V V 


Several  comments  need  to  be  made  concerning  Eqns.  (2G)_and  (21),  _The  first 
concerns  the  factor  exp  (-2h/j£")  which  is  present  in  both  < S > and  < S >u  * 

This  is  the  "height-loss"  factor , well-known  in  antenna  studies,  and  indicates 
that  the  power  coupled  into  the  surface  wave  is  a maximum  when  the  source  is  on 
the  interface  (h  = 0),  As  the  source  is  moved  away  from  the  surface,  the  power^ 
in  the  surface  wave  will  decrease  exponentially.^  Secondly,  the  power  flux  < S >* 
is  at  an  angle  - a with  respect  to  the  positive  x axis.  In  general,  power  flows 
into  the  surface  and  accounts  for  ohmic  losses.  For  a lossles  medium,  a ~ 0^ 
and  the  power  flow  is  parallel  to  the  interface.  Similarly  the  power  flux  < S >t! 
is  directed  into  the  medium  in  the  lossy  case*  However,  when  e2  - 0,<  S >n  is 
parallel  to  the  interface  but  opposite  to  < S >*  ^ m 

Now  we  consider  the  power  flux  per  unit  length  in  the  z direction  across  the 
plane  x = xQ,  This  quantity  is  denoted  by  Px(xQ)  and  is  defined  as 

OJ  o 

(22)  Px(x0)  = J*  < S >*•  x dy  + J < S >"*  x dy 


Using  Eqns.  (20)  and  (21)  for  the  power  flux,  we  obtain: 


*3  ^2  2 2”5f 

px<*0)  = ^y|-i 


exp  ( - T)  Lf, 


; Re /III 
0 J 1 + < 


+ f Re  -7= 


e + e~-JJ 


Then  the  power  coupled  into  the  surface  wave  P is  given  by:' 
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Ps  = 2 Px(0) 


(24) 


f 


[ 


Re 


1 + G 


Q” 


Re 


ynrr 


For  large  values  of  | e | , the  power  carried  off  by  the  surface  wave  is  predomi- 
nantly due  to  the  field  above  the  interface  and  is  proportional  to  j e Also, 

'’com  the  factor,  exp("2h/^)>  this  power  will  be  appreciable  only  if  h - j^, 
i.e.,  the  line  source  must  be  located  within  the  vertical  attenuation  length  of 
the  resulting  field. 

5,  CONCLUDING  REMARKS.  We  have  shown  that  with  a p-polarized  source  it  is 
possible  to  excite  surface  waves  directly  on  a homogeneous,  non-magnetic  half- 
space provided  that  the  dielectric  response  of  the  medium  is  such  that 
Re  J e / ( 1 + e)  > 1.  For  metals,  this  implies  that  the  frequency  of  the  source 
must  be  below  the  plasma  frequency  of  the  metal.  The  resulting  surface  waves 
are  Zenneck  waves  and  exhibit  exponential  attenuation  away  from  the  interface. 

We  have  also  derived  explicit  expressions  for  the  attenuation  lengths  and  for 
the  Poynting  vector  of  these  fields. 

The  duality  principle  implies  that  similarly,  it  is  possible  to  excite 
surface  waves  on  the  planar  interface  of  media  having  different  magnetic  per- 
meabilities using  an  s-polarized  source.  Our  results  can  be  restated  to  apply 
to  this  case  by  simply  substituting  E B,  H -*  -E,  and  jj.  e in  the  above 
formulae. 

The  present  investigation  represents  one  facet  of  our  continuing  study 
of  the  electromagnetic  (optical,  infrared)  properties  of  surfaces.  These  results 
form  an  intermediate  step  in  the  calculation  of  the  launching  efficiency  of 
electromagnetic  surface  waves  on  a corrugated  or  randomly  rough  surface.  Such 
calculations  would  apply  to  the  coupling  of  electromagnetic  radiation  into 
surface  structures  and  relates  to  problems  connected  with  laser  mirrors,  surface- 
wave  devices,  and  integrated  optics. 
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COMPARISON  OF  PERTURBATION-THEORETIC  AND  EXACT  CALCULATIONS 
OF  NONLINEAR  OPTICAL  PROPERTIES  OF  OPTOELECTRONIC  MATERIALS 

+ 

S,  S.  Mitra,*  L.  M.  Narducci,  and  R.  A.  Shatas 
Quantum  Physics,  Physical  Sciences  Directorate,  US  Army  Missile  Command 

Redstone  Arsenal,  AL  35809 


ABSTRACT.  Nonlinear  absorption  coefficients  due  to  multiphoton 
transitions  induced  by  intense  radiation  fields  have  been  calculated 
for  direct  bandgap  semiconductors  at  0.964,  1.06,  1.318,  and  10,6  |j,m 
laser  wavelengths  and  compared  with  experimental  results.  Second-order 
perturbation  models  of  Braunstein  (interconduction  band  transitions)  and 
Basov  (intravalence  or  intraconduction  band  transitions)  in  their  original 
form  yield  underestimates  and  slight  overestimates  of  the  nonlinear 
absorption,  respectively.  Corrections  to  these  models  based  on  a detailed 
consideration  of  effective  band  mass  and  interaction  parameters  are  pre- 
sented which  decrease  the  discrepancy  between  the  theory  and  the  experi- 
ment. The  Keldysh  model  gives  second-order  absorbances  that  are  in 
between  both  perturbation  calculations;  also,  it  predicts  very  well  the 
one-photon  band-edge  absorbance  in  GaAs  and  InSb.  All  models  predict 
only  the  correct  order  of  nonlinear  absorption  coefficient  (typical  a 2 48 
10-°...10~8  Cm  w"1).  Inclusion  of  exciton  bands  is  needed  for  a close 
agreement  between  theory  and  experiment.  This  suggests  that  the  incor- 
poration of  details  of  energy  bands  in  solids  in  semiclassical  nonlinear 
optoelectronic  calculations  is  more  important  than  a full  accounting  of 
quantum  electrodynamical  corrections. 

1.  INTRODUCTION . Under  illumination  by  intense  coherent  infrared 
radiation  sources  presently  available,  many  optoelectronic  materials  are 
driven  into  a nonlinear  response  region.  In  this  paper,  we  analyze  pro- 
cesses contributing  to  the  nonlinear  absorption  of  light  by  electronic 
transitions  taking  place  as  a result  of  a multiquantal  process  in  a dielec- 
tric or  a semiconductor  material  usually  transparent  to  low  intensity 
radiation.  We  also  show  that  the  absorption  at  high  radiation  fluxes  is 
describable  by  a nonlinear  differential  equation.  For  the  case  of  a weak 
radiation  field,  this  equation  can  be  linearized  and  its  solution  yields 
the  Lambert-Beer  law.  At  high  intensities,  however,  nonlinear  terms 
dominate,  and  consequently,  a general  analytic  solution  is  not  available. 
Physical  arguments  must  be  invoked  to  separate  terms  in  powers  of  intensity. 
For  each  particular  ratio  of  materials  properties  to  radiation  flux  param- 
eters, one  particular  term  dominates  the  absorption.  Electronic  transition 
rates  induced  by  photon  processes  of  appropriate  multiplicity  enter  as 
coefficients  in  the  nonlinear  differential  equation  describing  the  absorp- 
tion. Their  calculation  can  be  undertaken  in  the  framework  of  the  time- 
dependent  perturbation  theory  of  wave  mechanics;  also,  a semiphenomenologi- 
cap  approach  employing  the  rudiments  of  the  "dressed  state"  concept  of  the 
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quantum  field  theory  is  parametrized  in  terms  of  the  one-electron  effec- 
tive mass  approximation*  In  the  perturbation-theoretic  approach,  a semi- 
classical  treatment  of  the  interaction  operator  is  undertaken  in  which 
the  vector  potential  of  the  Maxwell  field  is  used  to  describe  the  optical 
radiation,  and  the  oscillating  electron  in  the  momentum  representation 
accounts  for  the  electronic  motion.  The  description  is  complicated  by 
the  need  to  employ  the  effective  mass  parametrization  of  the  energy  band 
theory  of  the  solid  state.  Within  the  limits  imposed  by  these  fundamental 
difficulties,  numerical  methods  were  developed  to  calculate  multiphoton 
absorption  for  a number  of  technologically  important  semiconductors. 
Because  of  a rather  unsatisfactory  state  of  the  fundamental  knowledge  in 
the  nonlinear  quantum  optics  from  both  the  physical  and  the  mathematical 
points  of  view,  a deeper  insight  through  the  employ  of  methods  leaning 
towards  the  axiomatic  quantum  field  theory  was  sought.  One  significant 
aspect  of  this  approach  is  that  the  dynamic  eigenlevel  shifts  of  the  origi- 
nally quasi-s tationary  states  caused  by  the  turn-on  of  the  radiation  field 
can  be  incorporated  at  the  beginning  of  calculations.  This  fast-varying 
modulation  of  the  eigenfunction  of  a discrete  state  of  the  matter  field  is 
not  usually  accessible  by  the  perturbation  expansion  of  the  firs t-quanti2ed 
wave  mechanics  simply  because  the  order  of  perturbation  expansion  usually 
is  not  carried  out  beyond  the  first-nonzero  contribution.  Computerized 
numerical  solutions  for  transition  probabilities  in  various  approaches 
are  given  and  compared  with  experimental  data;  disagreements  between 
theory  and  experiment  are  shown  to  arise  from  certain  inadequacies  of  the 
theoretical  approach.  Suggestions  with  respect  to  the  further  development 
of  the  nonlinear  quantum  optics  of  solids  are  offered.  In  reviewing  the 
research  literature  on  the  multiphoton  absorption  processes  in  gases,  we 
note  extensive  investigations  in  recent  years. Unfortunately,  despite 
numerous  theoretical  and  experimental  contributions,  there  is  a wide  scat- 
ter between  both  the  measured  and  the  predicted  values  of  multiphoton 
absorption*  Because  of  the  many  applications  of  pulsed  lasers,  there  is 
a need  to  identify  theoretical  approaches  which  are  suitable  to  predict 
at  least  the  order  of  magnitude  of  multiphoton  absorption  in  optically 
transparent  materials.  In  the  case  of  gases,  the  ratio  of  the  ionization 
potential  to  the  photon  energy  of  a typical  pulsed  laser  is  rather  high, 
therefore  atomic  photoionization  experiments  are  not  suitable  to  test 
theories  constructed  for  optical  electromagnetic  fields  of  moderate 
intensity.  Specifically  in  what  follows,  we  compare  two  low-order  per- 
turbation treatments  with  a semiclassical  procedure  in  which  the  band 
distortion  due  to  the  electromagnetic  field  is  taken  into  account.  Within 
the  context  of  models  proposed  by  Keldysh, ^ Brauns tein  and  Ockman,^  and 
Basov  et  al.,^>7  we  calculate  second-order  absorption  coefficients  for  a 
number  of  direct-bandgap  semiconductors  and  compare  theoretical  predic- 
tions with  available  experimental  results.  In  section  2,  we  consider  the 
rate  equations  describing  the  nonlinear  absorption.  In  section  3,  we 
outline  a general  formulation  of  the  multiphcton  absorption  within  the 
semiclassical  radiation  theory  in  the  electric  dipole  approximation. 

Section  4 deals  with  the  Keldysh  "exact"  model  which  employs  conduction 
and  valence  band  electronic  wavefunction  "dressed"  by  the  radiation  field. 
Section  5 contains  the  perturbation- theoretic  approaches  of  Brauns tein 
and  Ockman.  Comparison  between  the  theoretically  predicted  and  the  experi- 
mentally determined  values  of  nonlinear  absorption  coefficients  is  given 
in  section  6. 
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2.  ATTENUATION  OF  LIGHT  BY  NONLINEAR  ABSORPTION.  In  a transparent 
material,  residual  absorption  of  light  may  be  related  to  a number  of 
independent  processes  such  as  multiphoton  and  -phonon  excitations,  pres- 
ence of  residual  free  carriers  or  impurities,  creation  of  excitonic 
states,  and  perhaps  phonon  parametric  amplification.  At  sufficiently 
high  intensities  of  the  optical  field,  free  carriers  created  through  the 
elementary  excitation  processes  enumerated  above,  may  contribute  to 
the  time  dependence  of  the  absorption  through  the  free  carrier  inter- 
action with  the  electromagnetic  field  (so-called  inverse  Bremsstrahlung) . 
The  time  dependence  of  this  part  of  absorption  follows  from  the  rate 


equations  describing  the  free  carrier  density  and  shows  typically  transi- 
ents of  10“12  to  10"®  sec  duration  reflecting  the  trapping  and  recombina- 


tion rates  in  solid  materials.  In  these  considerations,  we  will  neglect 


these  transient  contributions. 


For  the  steady-state  description  of  the  nonlinear  absorption,  we 
argue  that  for  a given  material  with  a fixed  bandgap  between  the  valence 
and  conduction  bands  Eg  and  a given  incident  photon  energy  hr,  the 
photon  absorption  on  v-th  order  dominates.  Here  v denotes  the  ratio 
(EgAio  + 1) , and  the  brackets  ( ) stand  for  the  integer  part  of  this 
ratio.  Although  linear  energy  losses  caused  by  nonresonant  processes 
will  always  be  present  even  in  the  purest  materials,  low-intensity 
measurements  can  yield  reliable  values  of  the  linear  absorption  coef- 
ficient. Hence  observed  intensity  loss  due  to  multiphoton  absorption 
should  be  satisfactorily  described  by  the  phenomenological  rate  equation 


dl  _ 
<U  • 


(a^I1  + or  2^  +...+  <*nin) 


(2.1) 


where  A represents  the  spatial  coordinate  along  the  direction  of  travel 
of  the  light  in  the  material,  and  orv  is  the  v-photon  absorption  coefficient. 
The  total  experimentally  observed  intensity  attenuation  should  be  cor- 
rected for  the  contribution  due  to  nonresonant  losses.  The  corrected 
value  provides  the  rate  of  intensity  loss  dl/df  due  to  nonlinear  absorp- 
tion processes. 

In  the  above  rate  equation,  I denotes  the  incident  intensity  (light 
flux  in  units  of  W cm"2) . The  dimension  of  the  v-th  order  absorption 
coefficient  av  is  (length) ^v"3/(power)v"^.  If  the  multiphoton  process 
of  order  v dominates,  we  can  express  the  attenuation  rate  of  the  light 
flux  as 

dl  c V'  . 

TT  = - > .Of  I 


where  c is  the  velocity  of  light  in  vacuum,  and  e0  is  the  high-frequency 
dielectric  constant  of  the  material.  The  relation  between  the  flux  in 
the  material  I [W  cm"2j  and  the  peak  electric  field  amplitude  EQ  [V  cm"l] 
in  practical  electromagnetic  units  is  given  by  I = %(Eq  ,/e^/R0) , where  RQ 
is  the  vacuum  impedance  (377  Q).  The  relation  between  the  flux 
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I [W  cm”  ] and  the  photon  number  density  Np^  [cm  ] is  given  by 
I = (c/  jt)  ho  Np^.  Accordingly,  the  rate  of  photon  absorption  per 
unit  volume  is  expressed  by 


dN  . a 

EJl  = _v  Tv 

dt  hi)  1 


» 


where  Ne  designates  the  number  density  of  free  carriers  created  by  across 
the  gap  ionization  through  the  v-photon  absorption*  The  right-hand  side 
of  the  above  equation  is  also  a consequence  of  the  energy  balance  condition 
since  the  rate  of  the  photon  absorption  must  be  v -times  larger  than  the 
rate  of  the  free  carrier  creation;  hence  dNp^/dt  + v(dNe/dt)  = 0*  We 
also  designate 


dN 
e 

dt 


W(E„  ) 


where  w is  the  multiphoton  transition  rate;  consequently,  the  multi- 
photon absorption  coefficient  for  the  v-th  order  process  is  given  by 


2v  ho(2Ro)V  W(E*V) 

“v  v/2  2v 

00 


(2.2) 


In  eq.  (2.2)  cro  is  the  high  frequency  dielectric  constant,  and  the  factor 
of  2 is  introduced  to  account  for  the  free  carrier  spin  degeneracy.  Note 
that  despite  the  apparent  dependence  of  the  absorption  coefficient  upon 
E0,  the  actual  calculated  numerical  values  of  av  given  in  section  6 are 
field  independent . 

3.  MULTIPHOTON  ABSORPTION  BY  THE  SEMICLASS ICAL  TIME -DEPENDENT 
PERTURBATION . The  theoretical  description  of  multiphoton 
absorption  is  based  on  the  employ  of  second-  and  higher-order  time- 
dependent  quantum  mechanical  perturbation  theory.  In  the  case  of  the 
two-photon  process,  it  has  been  first  shown  by  M.  Goeppert-Mayer^  that 
the  interaction  term  coupling  the  momentum  of  the  electron^  p .with  the 
vector  potential  of  the  Maxwell  field  A,  given  by  (e/mc)  p . A, can  be 
canonically  transformed  to  P . This  transformation  is  valid  when  the 
linear  extent  of  the  interacting  electron-ion  system  is  small  with  respect 
to  the  wavelength  of  the  radiation  rield  (this  condition  is  readily 
satisfied  for  the  electric  dipole  mediated  transitions  into  far  ultra- 
violet wavelengths);  subsequently,  either  of  these  interaction  terms 
can  be  used  in  perturbation  expansion. 

At  first  we  shall  review  the  salient  features  of  the  perturbation 
theory.  Let  there  be  two  eigenstates  of  energy  Ef  and  Ef ; we  are  con- 
cerned with  an  electric  dipole  transition  between  them  induced  by  photons 
of  energy  hi)  < Ef  - Ef . Such  transitions  can  then  only  take  place  by 
the  participation  of  more  than  one  photon.  The  order  of  the  photon  process 
is  obviously  given  by  (1  + (Ef  - Ef)/hu)  where  ( ) indicates  the  integer 
part.  The  probability  of  such  transitions  will  depend 
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on  the  number  density  of  photons  or  the  electric  field  intensity  of  the 
radiation.  Large  field  intensities  will  also  introduce  level  shifts. 


Let  us  consider  the  time-dependent  Schrodinger  equation 

K*  - ill  ||  . (3.1) 

The  Hamiltonian  3C  consists  of  the  unperturbed  time-independent  Hamil- 
tonianKf5  and  a time-dependent  perturbation  term  3C' (r,  t)  . If  \|r°(r) 
defined  by 

5C°*°(r)  - En^°(r)  = 0 (3.2) 

are  the  eigenstates  of  K? , i|r£(r,t)  is  given  by 

*°(r,t)  = t°(r)  exp  [-i  ^ tj  . (3.3) 

The  general  state  function,  which  is  the  solution  of  eq.  (3.1),  can  be 
expressed  as  a superposition  of  states 

♦ <r,t)  an(t)  t°(r»t)  . (3.4) 

n 

The  probability  that  at  any  time  t the  system  is  in  a state  with  energy 
Ejj  is  given  by  |an(t)|2.  Substituting  eq.  (3.4)  in  eq.  (3.1),  and  making 
use  of  eqs.  (3.2)  and  (3.3),  one  obtains 

n' 

Equation  (3.5)  may  be  used  to  solve  ajj  to  any  order.  The  zeroth  order 
coefficients  a^°) (0)  at  the  time  t = 0 are  zero  except  for  the  initial  state, 
for  which 

a£0)  (0)  - 1 . 

The  first  order  coefficient  a^^(t)  is  therefore 

n 

= S'/  <*„lK,l*i>dt'  • <3-6> 

*,0 

To  obtain  the  second  order  correction  to  the  coefficient  an(t),  one 
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substitutes  the  above  value  of  a^^(t)  in  the  right-hand  side  of  eq. 
(3.1)  and  solves  for  a|^(t).  Continuing  this  iterative  process,  one 
can  write  for  a general  n-th-order  correction 


(n) 

af 


(t) 


ilr  ) dt"... 

T m 


n-1 


(3.7) 


The  repeated  indices  of  the  wave  functions  are  the  so-called  intermedi- 
ate states.  With  the  use  of  eq.  (3.3),  eq.  (3.7)  may  be  rewritten  in 
terms  of  the  unperturbed  wave  functions  as 


E - E ± ho.  ...An)  *’* 

, n r 1 n 


<Qk'I*°> 


‘ \± 


ho 


lJ 


/ 


exp 


i(Ef  - 


E.  ± ho,  . . ,iho  )t' 
i 1 n 


dt1 


(3.8) 


For  the  sake  of  generality,  every  one  of  the  n-photons  has  been  assigned 

a different  frequency  u)  / to  . 

n m 

The  transition  probability  rate  may  be  readily  obtained  from  (3.8) 
and  is  given  by 


»-  ,<“)  - £ I 


if 


aif 


(t)| 


(3.9) 


So  w.^  will  contain  square  of  the  expression  in  the  square  bracket  of  eq. 
(3.8^,  and  the  integral  in  eq.  (3.8)  will  introduce  a 6-function 


8(E_  - E.  db  ho , . . .dfcho  ) 
f i 1 n 


to  satisfy  the  energy  conservation  requirement. 

For  optical  transitions  in  a crystalline  solid,  the  initial  and 
final  states  are  obviously  the  valence  and  the  conduction  bands,  respectively. 
The  choice  of  the  intermediate  states  accounts  for  different  results  in 
different  calculational  schemes.  The  order  of  the  transition  equals  the 
number  of  quanta  of  the  perturbing  field  absorbed  or  emitted.  The  inter- 
mediate states  do  not  appear  in  the  5-function  but  in  the  denominator 
of  the  square  of  the  pre-integral  terms  of  eq.  (3.8).  Therefore  the 
contribution  of  the  intermediate  states  to  the  transition  probability 
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becomes  significant  when  they  are  located  close  to  the  initial  or  final 
states. 

Ijj  the  presence  of  a radiation  field  described  by  the  vector  poten- 
tial A , the  kinetic  energy  of  the  electron  is  modified  to 


Hamiltonian  in  eq.  (3.1) 


(3.10) 

and  uu  are  photon  wave 
its  unit  polarization  vector. 
In  eq.  (3.10)  we  have  neglected  the  I A.|  ^ term,  which  is  usually  very 
small  at  the  wavelengths  and  intensities  in  question.  The  amplitude 
&Aq  of  the  vector  potential  is  related  to  the  photon  density  Nph  by 


2m 


which  as  a result  introduces  a perturbation 
of  the  type 


me 


A -4  i 

where  A = A exp  [ifc  • r - cut]  . Here, 
vector  and  frequency,  respectively*  and 


2n  N , he 

vk 


(JUt 


(c«£»s«) 


(3.11) 


CO 


where  is  the  dielectric  constant.  The  calculation  of  the  transition 
probability  rate  then  boils  down  to  the  calculation  of  the  matrix  elements 
of  p • A between  the  valence  band  (or  bands) , the  chosen  intermediate 
levels  and  the  conduction  band.  This  is  essentially  what  Braunstein  et 
al*"*  and  Basov  et  al.^,7  models  pertain  to  do. 


Keldysh  treatment,  however,  differs  from  the  above  scheme  as  it 
_jlso  includes  the  level  shifts  caused  by  the  perturbing  electric  field 
E , which  was  neglected  by  Bijaunsteiin  and  Basov, The  energy  shift  of 
a level  induced  by  the  field  E - ^ E exp  [i(k  • r - cut)]  is  given 

by  ° 


(3.12) 


where  the  summation  extends  to  all  other  states  of  the  system.  In  the 
case  of  a semiconductor,  these  level  shifts  account  for  the  change  in 
band gap  in  the  presence  of  a field,  which  in  the  cas^  of  static  fields 
is  fairly  well  known,  and  is  termed  the  Franz-Keldysh  effect.  Keldysh's 
treatment  for  the  multiphoton  transition  probability  between  shifted 
levels  incorporates  the  dynamic  Franz-Keldysh  effect,  and  will  be  described 
in  section  4. 
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4.  THE  KELDYSH  "DRESSED  STATE"  MODEL.  Keldysh4  treated  the  multi- 
photon  absorption  as  the  high  frequency  limit  of  the  time-dependent 
tunneling  induced  by  the  oscillatory  electric  field  of  the  laser  radia- 
tion. In  other  words,  a unified  description  is  available  for  the  auto- 
ionization process  under  the  influence  of  a strong  low-frequency  field 
and  for  the  multiphoton  ionization  induced  by  a strong  high-frequency 
field*  Keldysh1 s treatment  applies  to  both  isolated  atoms  and  crystal- 
line solids  (e*g*,  semiconductors)*  In  both  cases  Keldysh  calculates 
the  probability  of  direct  electron  excitation  from  the  ground  state  (or 
valence  band)  to  the  continuum  (or  conduction  band)  and  the  excitation 
probability  through  intermediate  states  of  the  discrete  spectrum*  The 
intermediate  states  could  be  higher  excited  states  of  the  Coulomb  field 
of  isolated  atoms,  or  impurity  levels  and  exci tonic  states  in  crystal- 
line solids* 

Keldysh  considers  a system  described  by  a Hamiltonian  of  the  form 

K = 3C0  +KC  + 3^  (4.1) 


where  is  the  kinetic  energy  term,  Kq  is  the  Coulomb  contribution  to 
the  potential  energy  (or  the  crystal  field  in  solids),  and  is  the 
interaction  energy  of  the  bound  charge  in  a periodic  electric  field 
E(t)  = E0  cos  cot*  The  electron  is  assumed  to  be  initially  in  the  un- 
perturbed Coulomb  ground  state  given  by  the  wave  function 


tQ(r»t)  = i|ro(r)  exp  (-T^t/h) 

■ exp  (-r/ao)  exp  (-i<SQt/h)  (4.2) 

<"4> 


2 2 

where  a = h /|ne  z) is  the  z-charged  atom  Bohr  radius  and  its  ioniza- 
tion potential. 

In  the  case  of  a crystalline  solid,  the  Bloch  wave  function  is  modi- 
fied by  the  presence  of  the  electromagnetic  field  as  follows.  (In  a sense, 
the  bare  electron  state  is  "dressed"  by  the  electric  field  component  of 
the  e.m.  field.) 

♦pV^(r,t)  - up ( t) & exP  jh  (p^  * * -/  drEv[p(T)]^|  (4*3) 

where  . 

-¥ 

-»  -»  eE 

P(t)  - p + pp  sin  cot  . (4.4) 


The  amplitude  functions  u 


(v) 

P(t) 


(r) 


correspond  to  a valence  electron  with 
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momentum  p(t)  and  have  the  translational  symmetry  of  the  lattice.  The 
principal  difference  between  the  traditional  treatment  of  this  problem 
and  the  Keldysh  approach  is  that  he  recognizes  the  modification  of  the 
unperturbed  bands  under  the  influence  of  the  field.  T£is  modification 
amounts  to  a replacement  of  the  unperturbed  momentum  p with  the  time- 
dependent  momentum  given  by  eq,  (4.4).  The  excited  electronic  states 
associated  with  the  continuum  of  an  atom  or  in  the  conduction  band  of 
a solid  are  assumed  to  be  unaffected  by  the  electrostatic  Coulomb  energy, 
but  are  modified  by  the  perturbing  effect  of  the  electromagnetic  field. 
Thus,  in  the  case  of  isolated  atoms,  the  ionized  electron  is  described 
by  the  solution  of  the  Schrodinger  equation 


(4.5) 


(4.6) 


For  the  case  of  a solid,  the  conduction  electron  is  described  by  the 
Bloch  function 


tpC^(r,t)  = u^)(r)  exp  j“  Cp(t)  . r - j dTEc[p(T)]j  . (4.7) 

One  again  notices  that  the  wave  function  of  the  final  state  is  also  dressed 
by  the  perturbing  action  of  the  field.  As  a consequence,  the  electronic 
transition  does  not  occur  between  unperturbed  states,  but  rather  between 
nonstationary  states  in  which  the  electron  acceleration  due  to  the  field 
is  taken  into  account.  Finally,  the  transition  probability  rate  is  cal- 
culated according  to  first  order  perturbation  theory.  The  calculated 
transition  probability  is  then  summed  over  all  possible  final  momenta 
of  the  quasi-free  electron. 

In  order  to  arrive  at  an  explicit  expression  for  the  transition 
probability,  Keldysh  used  the  following  parabolic  energy-band  relation 
for  a solid 


(4.8) 


where  m*  designates  the  reduced  effective  mass  of  the  electron-hole  pair 
1/m*  = 1/m*  + 1/%.  Eg  denotes  the  width  of  the  bandgap,  and  _£he  momen- 
tum and  k-space  representations  are  connected  by  the  relation  p = hk. 
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In  the  limiting  case  when  the  parameter 


Y 


(2m*E  )' 
g 


(4.9) 


is  much  larger  than  unity  (a  condition  which  is  readily  satisfied  for 
most  of  crystalline  solids),  Keldysh* s transition  rate  (electronic  transi- 
tion probability  per  unit  volume  and  per  unit  time)  is  given  by 


v = £u,  (^“)2![(2<x  + 1)  - 2x)^] 

v(x+l) 


A2 


2 

16m%  E 


exp 


,g> 


2<x  + 1)  1 - 


2 2 
e E 

o_ 

2 

E 


g/ J 


(4.10) 


The  meaning  of  symbols  in  eqs,  (4.9)  and  (4.10)  is  given  below: 
y ■*  6.45  x 10^  (m*Eg)'2/(\E0) , where  m*  is  given  in  units  of  electron 
mass.  Eg  in  eV,  X -wavelength  of  incident  light  in  vacuum  (inp,m),  E0 
is  the  electric  field  amplitude  in  the  material  in  units  of  MV  m“*,  v 

-z2  z y2  / 2 2 , 2 \ 

5(2)  = e Jo  e;  dy  is  the  Dawson  integral,  x = Eg/foall  4-  e E0/(4m%  Eg)l  , 

and  (...)  is  the  integer  part  of  the  argument.  ' ' 

Up  to  five  photon  transition  probability  rates  have  been  calculated 
by  means  of  relation  (4.10)  for  a number  of  semiconductors  as  function 
of  wavelength  of  light  at  a given  field  intensity  of  10  MV  m"l  in  the 
material.  The  relevant  band  masses  and  bandgaps  used  in  these  calcula- 
tions are  listed  in  Table  1. 


Table  1.  Material  parameters  used  in  calculation  of  multiphoton  transi- 
tion rates  as  function  of  energy  of  incident  laser  light  shown  in 

figs.  1 to  6. 


Semiconductor 

Band gap 
(eV) 

Effective  mass 

ratio 

' Electron 

Hole 

Reduced 
Pair  mass 

t 

GaAs 

1.53 

0.068 

0.5 

0.06 

GaSb 

0.8 

0.047 

0.5 

0.043 

InAs 

0.46 

0.02 

0.41 

0.019 

PbS 

0.34 

0.66 

0.5 

0.364 

InSb 

0.26 

0.013 

0.6 

0.0127 

PbTe 

0.24 

0.22 

0.29 

0.125 

The  log  w vs  photon  energy  plots  are  given  in  figs.  1 to  6.  Num- 
bers written  above  each  plateau  of  the  transition  rate  indicate  the  integer 
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23 


-3  -1 

Fig.  1.  GaAs  multiphoton  transition  rate  (?.-nd  to  4-th  order)  in  cm  s induced 
by  a constant  optical  electric  field  of  10^  V cm"1  amplitude  and  varied  frequency. 


Fig.  2.  GaSb  multiphoton  transition  rate. (1-st  to  4-th  order)  in 
cm^  s"1  induced  by  a constant  optical  electric  field  of  10^  V cm“^ 

amplitude  and  varied  frequency. 
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LOG  TRANSITION  RATE  (cm 


PHOTON  ENERGY  (W) 

Fig.  3.  InAs  multiphoton  transition  rate  (1-st  to  4-th  order) 
in  cm”3  s-1  induced  by  a constant  optical  electric  field  of 
104  V cm"-*-  amplitude  and  varied  frequency. 
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LOO  TRANSITION  RATE  (cm 


PHOTON  ENERGY  (eV) 

Fig,  6.  PbTe  multiphoton  transition  rate  (1-st  to  4-th  order)  in 
cm“3  g-1  induced  by  a constant  optical  electric  field  of  lO^  V cm-* 

flinolitude  and  varied  frequency. 
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value  of  (Eg/bv  + 1).  Apart  from  quantitative  differences,  the  qual- 
itative functional  dependence  in  figs.  1 to  6 is  strikingly  similar  in 
that  a quasi-resonant  behavior  at  integer  E„/hu  values  is  displayed. 

This  reminds  us  of  one  of  the  most  obvious  features  of  the  multiphoton 
transition  probability  calculated  by  Bebb  and  GoldlO  for  hydrogen-like 
atoms  which  shows  the  distinctly  resonant  dependence  of  the  excitation 
probability  as  a function  of  the  incident  photon  energy.  While  the 
energy  levels  of  isolated  atoms  are  clearly  quite  different  from  the 
bands  of  semiconductors  considered  in  Keldysh's  model,  a quasi-resonant 
dependence  of  the  transition  rate  on  the  incident  photon  energy  seen  in 
our  calculations  is  qualitatively  correct. 

11 

It  should  be  also  remarked  that  Weiler  et  al.  have  extended 
Keldysh's  calculation  to  include  the  effects  of  longitudinal  or  trans- 
verse magnetic  fields  on  the  interband  electronic  transition.  Weiler' s 
conclusions  are  in  agreement  with  Keldysh's  results  for  zero  applied 
magnetic  field.  One  expects  a series  of  edge  absorptions  of  the  form 
— (vhtu  - Eg)^  where  v is  the  photon  multiplicity.  Such  a behavior 
is  indeed  found  in  the  negative  slope  of  the  transition  rate  after  each 
quasi-resonant  "peak"  as  seen  in  figs.  1 to  6.  It  has  been  argued  that, 
at  best,  Keldysh's  theory  should  provide  acceptable  results  only  for 
processes  of  fairly  high  photon  multiplicity  and  that  its  application 
to  two-photon  absorption  processes  should  not  yield  more  than  qualita- 
tive agreement  with  the  experimental  data.  Recent  experiments^  involv- 
ing four-photon  transition  in  ZnS  are  at  variance  with  Keldysh's  pre- 
diction, while  they  are  in  fair  agreement  with  perturbative  semiclassical 
calculations.-^  However,  a good  agreement  with  three-photon  absorption 
in  CdS  has  been  found.  C.  H.  Lee  measured^  the  three-photon  absorption 
coefficient  0*3  = 1.3  . 10"^  cm^/GW^  for  single-crystal  CdS.  Our  calcu- 
lations based  on  the  evaluation  of  eqs.  (2.2)  and  (4.10)  yielded  a value 
of  <23  = 0.2  • 10“2  cm-^/GW^.  We  have  used  Eg  = 2.42  eV,  = 5.32  and  ^ * 

m*  = 0.192  as  CdS  material  parameters  in  this  calculation.  A more  syste- 
matic comparison  for  the  two-photon  absorption  coefficient  is  presented 
in  section  6.  An  additional  feature  of  the  Keldysh  model  was  found 
(which  was  rather  unexpected)  that  it  predicts  very  well  the  one-photon 
absorption  coefficient  near  the  band  edge  for  two  thus  far  analyzed  semi- 
conductors, GaAs  and  InSb.  It  is  known  that  near  the  fundamental  absorp- 
tion edge,  the  one-photon  absorption  coefficient  can  be  expressed  as 

al  * ACbu>  - Eg)Y  (4.11) 

where  til)  is  the  photon  energy,  and  y is  a constant  which  equals  1/2 
and  3/2  for  allowed  direct  transitions  and  forbidden  direct  transitions, 
respectively.  We  specialize  eq.  (4.11)  for  the  case  fou  > Eg.  In  addi- 
tion, the  exponent  y equals  2 for  indirect  phonon  assisted  transitions 
and  1/2  for  allowed  indirect  transitions  to  exciton  states. 

Near  the  absorption  edge,  where  the  values  of  (far  > Eg)  become  com- 
parable to  the  binding  energy  of  the  exciton,  the  Coulomb  interaction 
between  the  free  hole  and  the  electron  must  be  taken  into  account.  For 
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ha  < Eg  the  absorption  merges  continuously  into  that  due  to  the  higher 
excited  states  of  the  exciton;  when  hi)  » Eg,  higher  energy  bands  will 
participate  in  the  transition  process  and  complicated  band  structures 
will  be  reflected  in  the  absorption  coefficient.  The  least  square  fit  of 
absorbance  vs.  photon  energy  curves  for  photon  energies  near  the  band 
edge  of  GaAs  yields  the  following  values  for  A and  y : A = 44  . 

10-*,  v = 0.499,  and  A * 44.7  . 10  , y = 0.505  from  Moss  and  Hawkins'1 
and  from  Sturge's^  measurements,  respectively.  Numerical  evaluation 
of  eqs.  (2.2)  and  (4.10)  for  the  one-photon  transition  case  v = 1 can 
be  expressed  by  the  functional  dependence  shown  in  eq.  (4.11)  with  A = 

44  • 10^  and  y = 1/2.  Hence,  the  Keldysh  model  besides  giving  the 
correct  value  of  the  absorption  coefficient  at  the  band  edge  of  GaAs 
describes  very  well  its  wavelength  dependence  also.  This  agreement 
is  seen  in  fig.  7 where  unadjusted  calculated  absorption  coefficients 
are  compared  with  Moss  and  Hawkin' s experimental  values  for  photon 
energies  between  1.42  to  1.48  eV.  GaAs  parameters  used  in  calculating 
the  theoretical  absorption  constant  of  fig.  7 are  listed  in  Table  2. 


Table  2.  List  of  parameters  for  GaAs  used  in  the  theoretical  fit  of  fig. 
7.  The  values  for  reduced  effective  mass  m*  and  dielectric  constant 
have  been  obtained  from  [18]  and  [19],  respectively. 


Data  source 

E [eV] 
g 

m* 

e 

QD 

[15]  o 

1.403 

0.059 

10.9 

[16]  (21°K) 

1.521 

0.059 

10.9 

[16]  (294°K) 

1.435 

0.059 

10.9 

For  the  comparison  of  theoretical  and  experimental  one-photon  absorption 
coefficient  of  InSb,  the  experimental  data  points  have  been  taken  from 
fig.  3 of  ref.  20.  The  empirical  equation  which  fits  the  data  around 
the  band  edge  is  listed  in  Table  3 together  with  the  reduced  effective 
mass  and  the  high  frequency  dielectric  constant  which  are  needed  for  the 
theoretical  comparison. 

Table  3.  Empirical  relation  for  the  absorption  coefficients  of  InSb 
reported  in  [20] . The  bandgap  energy  Eg  has  been  calculated  by  a least 
square  fitting  procedure.  The  reduced  effective  mass  m*  and  the  high- 
frequency  dielectric  constant  have  been  obtained  from  [19]  and  [21], 

respectively. 


O'  = 2.026  . 104  (ho  - E )^[cm  ^] 
hi), Eg  in  eV 
E = 0.2248  [eV] 

8 _ OQ 

m*  = 0.0113  m ; m = 9.108  • 10  [g] 

e - 15.7 

» 


In  fig.  8 we  have  plotted  the  experimental  data  together  with  the  results 
of  the  numerical  evaluation  of  eqs.  (2.2)  and  (4.10).  We  note  that  the 
quantitative  agreement  between  the  Keldysh  theory  and  the  measured  data 
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is  rather  good,  particularly  if  we  consider  the  large  range  of  incident 
photon  energies  included  in  the  comparison.  A further  discussion  on  the 
agreement  of  Keldysh  model  with  the  second-order  perturbation  theory  is 
given  in  section  6. 

5.  SECOND-ORDER  PERTURBATION  MODELS.  In  this  section,  we  consider 
two  second-order  perturbation  models  first  proposed  by  Braunstein  and  Ockmah 
and  by  Basov  et  al.6>7  ^s  explained  in  section  3,  in  the  second-order  per- 
turbation an  intermediate  state  is  required  to  complete  the  transition  from 
the  initial  to  the  final  state  of  the  system  perturbed  by  the  radiation 
field.  A proper  accounting  of  intermediate  states  becomes  important  when 
their  energy  eigenstates  are  close  to  initial  or  final  states  of  the  sys- 
tem, or  if  they  coincide  with  other  real  states  encountered  in  practical 
materials  such  as  exciton  or  impurity  states.  Therefore,  the  agreement 
between  the  calculated  and  the  observed  nonlinear  absorption  coefficients 
will  depend  largely  upon  the  inclusion  of  the  appropriate  details  of  the 
energy  band  structure. 

Braunstein  and  Ockman^  consider  vertical  transitions  between  unper- 
turbed parabolic  bands.  They  assume  that  the  only  significant  intermediate 
state  is  a higher  conduction  band  designated  with  n in  Fig.  9. 


E 


Fig,  9.  The  interconduction  band 
transition  model  of  Braunstein; 
v , c , and  n refer  to  the 
valence,  lowest  conduction,  and 
intermediate  conduction  bands, 
respectively.  All  energy  bands 
are  taken  to  be  parabolic  in  the 
k-space. 


Second-order  perturbation  theory  yields  the  transition  rate  per  unit 
volume  for  vertical  transitions  at  a given  value  of  k ; subsequently, 
the  k-dependent  probability  is  integrated  over  all  values  of  k . For 
allowed-allowed  transitions,  the  result  is 


w = 


2|v  | 2 

nc  ^vn1  m' 

72 


K l V 

8uiv 


1 (2ho  - E )*  / 

S-y  L 

(*  + O2  V 


a + a 

AE  - ho  + n — — - (2ho 
ac  + av 


v) 


-2 


(5.1) 


In  eq.  (5.1),  3Cnc  and  JC^  are  the  matrix  elements  of  the  interaction 
Hamiltonian  between  the  conduction  c and  the  valence  v bands,  and  the 
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intermediate  conduction  band  n , respectively.  The  free  electron 
mass  is  designated  with  m , and  a^  (i  ■ c,v,n)  is  the  reciprocal  effec- 
tive mass  ratio  m/m?  for  the  i-th  band,  a^  = m/m*.  The  energy  gap  AE 
is  the  energy  difference  between  the  top  of  the  heavy  hole  band  (the  only 
valence  band  included  in  this  calculation)  and  the  bottom  of  the  inter- 
mediate conduction  band. 

The  interaction  Hamiltonian  is  taken  to  be  of  the  form 


1 . 


We  will  evaluate  the  matrix  elements  of  the  interaction  term  by  the  so- 
called  k » p method. 22  in  this  method,  the  matrix  elements  of  the  effec- 
tive Hamiltonian  are  given  by 


JC  f = [Ei(V  + SI  (^2  “ + m (^  ' ^o)  * P/  i 

6 jo  2m  o j,£  m o j 


(5.2) 

Consider  a solid  consisting  of  two  energy  bands  denoted  by  _^0  and  1 
each  having  an  extremurp  in  the  reciprocal  lattice  space  at  kQ  = 0 and  at 
corresponding  momenta  pq^  m p^Q  = P taken  to  be  isotropic.  We  choose  the 
zero  of  energy  axis  to  be  at  the  top  of  the  lower  band  0 such  that 
E0(k0)  = 0 and  E^(k0)  = Eg  for  the  upper  band,  Eg  being  equal  to  the  for- 
bidden gap  width.  These  assumptions  restrict  our  treatment  to  solids  with 
allowed  direct  transitions  between  the  two  bands.  The  Hamiltonian  operator 
is  then  given  by  a 2 X 2 matrix  shown  below 


hV 

2m 


K = 


m 


k . p 


hV 
Eg  + ir 


(5.3) 


Its  diagonalization  yields  the  following  eigenvalues  of  the  matrix  (5.3) 


En  - (k)  - E /2  + h2k2/2m  ± J E2/4  + hW 
0,1  g 1 g r0,l 


(5.4) 


For  small  values  of  k , the  square  root  in  eq.  (5.4)  can  be  expanded 
in  a power  series  about  Eg/2.  Retention  of  first-term  only  yields  the 
following  expression  for  the  lower  energy  band 


E0(k)  = 


2m 


[1 


. aiiii 

mE  J 

g 


(5.5) 


and 
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for  the  upper  energy  band.  The  effective  reciprocal  mass  tensor  of  the 
carrier  in  n-th  band  is  defined  by 


(m/m£) 


3^  (k) 
m n 

,2  9k  3k 

h as 


(5.7) 


Because  of  the  assumption  of  isotropic  matrix  elements  in  the  Hamiltonian 
(5.3)  of  the  two-band  model,  eqs.  (5.5)  to  (5.7)  yield  scalar  reciprocal 
effective  masses  for  free  carriers,  [m/m*]  * 1 - 2p^/(mE„),  and 

2 o 

[m/nt|]  = 1 + 2p  / (mEg)  for  the  lower  and  the  upper  energy  bands,  respec- 
tively. We  take  the  lower  band  0 to  be  the  highest  valence  band  and 
the  upper  band  1 the  lowest  conduction  band  of  a solid  with  a direct 
energy  bandgap.  The  effective  reciprocal  masses  of  holes  h in  the 
valence  v and  electrons  e in  the  conduction  c bands  are  thus  given 
by  the  scalar  relations 


[m/m*]  = 1 - 2p2/ (mEg) 


(5.8) 


and 


[m/m*]  = 1 + 2p  /(mEg) 


(5.9) 


respectively. 

The  band  curvatures  of  the  valence  and  the  conduction  bands  are  of 
opposite  signs  which  accounts  for  the  opposite  signs  of  the  p^  term  in  the 
eqs.  (5.8)  and  (5.9).  Keeping  in  mind  the  opposite  curvatures,  we  can 
rewrite  eqs.  (5.8)  and  (5.9)  in  absolute  values  to  yield 

| m/m*|  = 2p2/ (mE  ) - 1 (5.8a) 

h § 


and 


| m/m*(  = 2p2/ (mE  ) + 1 . (5.9a) 

® § 

2 

From  (5,8a)  and  (5.9a),  one  readily  obtains  m/m*  « 4p  /(mE*)*  The  energy- 
momentum  relation  needed  to  evaluate  the  matrix  elements  or  the  interac- 
tion Hamiltonian  in  terms  of  the  one-electron  effective  mass  approximation 
of  the  solid  state  theory  is  therefore  given  by 


(5.10) 
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£-»-£(  JE.) 

2m  8 Sn*; 


which  is  quite  satisfactory  for  small  values  of  k , We  proceed  in 
evaluating  the  matrix  elements  of  eq.  (5.1)  with  the  help  of  eq.  (5.10) 
and  substitute  the  transition  rate  into  eq.  (2.2)  for  the  case  v = 2. 
The  second-order  nonlinear  absorption  coefficient  is  given  by 
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(5.11) 


The  fnc  and  f^  denote  the  oscillator  strengths  for  the  transitions  medi- 
ated by  and  K^c>  respectively.  In  numerical  calculations  given  in 
section  6,  it  is  taken  that  ^nc  ^vn  = 1.  All  parameters  in  eq.  (5.11) 
are  expressed  in  c.g.s.  electrostatic  units. 

Equation  (5.11)  has  been  specialized  for  the  case  of  single  photon 
beam  of  frequency  0)  in  the  material  (rather  than  two  beams  at  two 
different  frequencies  (JUt  and  102).  Braunstein  and  Ockman's  expression 
for  the  nonlinear  absorbance  given  by  eq.  (7)  of  ref.  5 must  be  multiplied 

by  a factor  of  l/(e^)(  m/(2m*) j ^ to  obtain  our  eq.  (5.11).  The  first  term 

in  the  corrective  factor  is  due  to  the  different  formulation  of  absorption 
coefficient  in  our  eq.  (2.2)  as  compared  with  eq.  (2)  of  ref.  5,  and  the 
second  term  arises  because  we  evaluated  the  matrix  elements  of  allowed 
transitions  in  the  k . p approximation  rather  than  by  employing  eq.  (4) 
of  ref.  5.  Our  eq.  (5.11)  includes  appropriate  factors  for  spin  orienta- 
tion and  photon  multiplicity. 


A generalization  of  Braunstein1 s calculation  for  anisotropic  energy 
bands  has  been  given  by  Hassan.^3  His  results  agree  with  our  eq.  (5.11) 
in  the  limit  of  zero  anisotropy. 

6 7 

In  Basov  et  al.  * model,  the  two  highest  valence  bands  vj  and 
and  the  lowest  conduction  band  c are  coupled  by  the  radiation  field* 
Furthermore,  the  intermediate  states  are  conduction  (or  valence)  intraband 
states  as  shown  in  Fig.  10* 
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Fig,  10,  The  energy  band  model 
employed  by  Basov  et  al.  Para- 
bolic energy  bands  v^,  V2  refer 
to  the  bound  electrons  in  valence 
band,  and  c is  the  conduction 
band.  Intermediate  levels  are 
provided  by  intraconduction  and 
intravalence  states.  Only  intra- 
conduction band  transitions  are 
explicitly  indicated. 


Second  order  perturbation  calculation  of  this  model  leads  to  the 
following  expression  for  the  second  order  absorption  coefficient 


or 
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_j»here  e is  the  unit  polarization  vector  of  the  incident  radiation, 
p is  the  momentum  operator  of  the  electron  and  the  index  i = 1,2  refers 
to  the  initial  state  of  the  electron  in  the  valence  bands.  Contrary  to 
Basov  et  al.,  we  use  eq.  (5.10)  to  evaluate  the  momentum  matrix  elements, 
and  furthermore  we  average  over  all  directions  in  the  k-space.  The 
total  absorption  coefficient  must  contain  summation  over  both  valence 
band  contributions,  <*2  = EjOfji).  Since  each  valence  band  contributes 
about  equally,  we  need  to  calculate  only  ot^  . Therefore  values  listed 
in  Table  7 of  section  6 must  be  multiplied  by  a factor  close  to  2 to 
obtain  the  total  second-order  absorption  coefficient  in  this  model. 

In  our  eq.  (5.12),  factor  of  2 has  been  introduced  to  account  for 
the  spin  degeneracy.  This  factor  has  been  omitted  in  the  original  work 
by  Basov  et  al.  Further  differences  between  our  eq.  (5.12)  and  Basov 
et  al.  expressions  arise  because  of  an  incorrect  numerical  factor  used 
in  relating  the  flux  intensity  with  the  magnitude  of  vector  potential 
in  ref.  6.  This  gives  a multiplicative  correction  of  1/16.  Averaging 
the  scalar  product  | e . p|  2 over  all  directions  in  the  k-space  yields 
another  multiplicative  correction  of  1/3.  Hence,  Basov  et  al.  expres- 
sion for  the  second-order  absorbance  in  the  context  of  eq.  (6)  of  ref. 

6 must  be  multiplied  by  a factor  of  1/48. 
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This  has  been  noted  previously  by  Lee  and  Fan  and  Fossum  and 
Chang. 21  The  need  for  this  correction  does  not  arise  in  connection 
with  our  eq.  (5.12)  because  our  absorption  coefficient  a 2 is  intensity- 
independent  and  it  was  calculated  from  our  eq.  (2.2)  rather  than  eqs. 

(6)  and  (8)  of  ref.  6,  and  the  averaging  over  the  k-space  has  been  also 
performed  in  the  calculations  listed  in  section  6.  A further  difference 
between  Basov  et  al.  work  and  our  eq.  (5.12)  arises  from  our  use  of  the 
lc  . p method  in  evaluating  the  matrix  elements  of  the  interaction  opera- 
tor. Because  of  eq,  (5.10) j we  evaluate 

! <e  * P>_  I 


m*/m  in  connection  with 
eq.  5 and  the  relation  p2/2m  = 3Eg/4m*  in  the  experimental  results  sec- 
tion of  ref.  6.  In  section  6,  we  report  the  numerical  evaluation  of  eq. 
(5.12). 

6.  COMPARISON  BETWEEN  CALCULATED  AND  MEASURED  ABSORPTION 
COEFFICIENTS . In  this  section,  we  compare  absorption  coefficients  cal- 
culated from  eq.  (2.2)  with  available  experimental  data  for  the  second- 
order  nonlinear  absorption.  First,  we  evaluate  the  transition  rates  for 
photon  multiplicities  v = 1 to  3 in  the  perturbed  valence  and  conduction 
band  wavefunction  model  of  Keldysh,  eq.  (4.10).  For  the  second-order 
calculation,  we  also  use  the  perturbation-theoretic  approaches  for  the 
two  different  band  models,  eqs.  (5.11)  and  (5.12). 

In  the  Keldysh  model  calculation,  it  is  necessary  to  evaluate  the 
Dawson  integral  numerically  by  using  the  trapezoidal  rule  of  integration. 

In  general,  the  accuracy  of  the  numerical  integration  is  of  the  order  of 
a few  parts  in  10^.  If  it  is  desired  to  increase  the  accuracy  of  the  inte- 
gration, increase  the  number  of  intervals  N in  the  program  line  260. 

The  computation  time  will  increase  accordingly. 

The  computer  program  listed  in  Table  4 has  been  written  in  a modified 
BASIC  programming  language  for  the  HP  9830A  calculator  equipped  with  ROM’s 
containing  the  mathematics  package  and  the  plotter  control.  This  program 
(less  lines  1000  and  above,  which  contain  the  calculation  of  the  absorp- 
tion coefficient),  with  an  appropriate  modification  of  the  plotting  rou- 
tine, has  been  also  employed  in  plotting  Figs.  1 to  6 of  section  2. 


m 


E /4m* 
g 


whereas  Basov  et  al.  use  the  relation  p /2m  = Ec 
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Table  4.  HP  9830A  computer  program  used  to  calculate  and  to  plot  the  wave- 
length dependence  of  absorption  coefficients  of  the  order  1 to  3 from  eqs. 
(4.10)  and  (2.2).  The  computer  system  consists  of  extended  memory  core 
HP  9830A,  built-in  mathematics  and  plotter  control  ROM's,  the  HP  9866A 
printer  and  the  HP  9862A  plotter.  Instructions  for  use  are  contained  in 

lines  30  to  255. 


30  REM  THIS  PROGRAM  CALCULATES  THE  TRANSITION  RATE  PREDICTED  BY  KELDYSH 
40  REM  AND  PLOTS  VS  PHOTON  ENERGY  IN  ELECTRON  VOLTS;  KEY  IN  RUN  AND  FOLLOW 
INSTRUCTIONS 

42  REM  IT  ALSO  PLOTS  ABSORPTION  COEFF.  VS  PHOTON  ENERGY  IN  ELECTRON  VOLTS 
50  REM  NOTE;  AT  LEAST  ONE  INCREMENT  MUST  BE  ASSIGNED 

70  PRINT  "INPUT  PLOTTER  SCALE,  XMIN,  XMAX,  YMIN,  YMAX" 

71  PRINT 

72  PRINT 

80  INPUT  SI,  S2,  S3,  S4 
90  SCALE  SI,  S2,  S3,  S4 
95  XAXIS  S3,  S2/10,  SI,  S2 
100  YAXIS  SI,  S4/2,  S3,  S4 
115  FLOAT  3 

120  PRINT  "ENTER  GAP  IN  EV,  LAMBDA  IN  MICRONS" 

130  PRINT  "ENTER  EL.  FIELD  IN  UNITS  OF  MEGAVOLTS/M,  MASS  IN  EL.  MASS" 

140  PRINT 

150  INPUT  G,  L,  E,  M 
160  PRINT 

170  PRINT  "IF  YOU  WANT  TO  INCREMENT  GAP,  KEY  IN  DELTAGAP;  IF  NOT  KEY  IN  ZERO" 
180  PRINT 
190  INPUT  Gl 

200  PRINT  "IF  YOU  WANT  TO  INCREMENT  LABMDA,  KEY  IN  DELTALAMBDA;  IF  NOT,  KEY 
IN  ZERO 
210  PRINT 
220  INPUT  LI 

230  PRINT  "IF  YOU  WANT  TO  INCREMENT  EL.  FIELD,  KEY  IN  DELTAFIELD;  IF  NOT, 
ZERO" 

240  PRINT 
250  INPUT  El 

252  PRINT  "TO  PLOT  ABSORPTION  COEFF.  INPUT  DIELECTRIC  CONST  (E2)  IF  NOT, 

ZERO" 

253  PRINT 

254  PRINT 

255  INPUT  E2 
260  N-20 

270  PRINT  "GAP=";G,"LAMBDA=";L 

280  PRINT  "EL.  FIELD=";E,"MASS=";M 

290  PRINT  "Gl="Gl;"Ll="Ll;"El="El 

295  PRINT  "Sl="Sl;"S2="S2;"S3="S3;"S4="S4 

297  PRINT  "E2="E2 

300  PRINT 

310  PRINT 

320  A=8.76057E+36*(M/L)tl.5/L 
330  B=1.23752E-08*(E*L)t2/(M*G) 

340  X=0 .8060 15*G*L* ( 1+B ) 
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350  Rl=INT(X+l) 

360  Z= (2*Rl-2*X) 1 0 . 5 

370  Wl-A* (B/4) tRl*EXP (2*R1* (1-B) -Z*Z) 

380  Y=Z/N 

390  P=(1+EXP(Z*Z))*Z/(2*N) 

400  F=0 

410  FOR  I«1  TO  N-l 
420  F=F+EXP((I*Z/N)t  2) 

430  NEXT  I 

440  W2=P+F*Z/N 

450  W=W1*W2 

460  IF  G1=0  THEN  520 

470  PLOT  G.LGTW 

/a90  IF  G>5  THEN  630 

500  G=G+G1 

510  GOTO  320 

520  IF  UK)  THEN  570 

524  IF  E2#0  THEN  530 

525  PLOT  1. 2395/L, LGTW 
527  IF  E2-0  THEN  540 
530  GOSUB  1000 

540  IF  I>21  THEN  630 

550  L=L+Ll 

560  GOTO  320 

570  IF  E1=0  THEN  630 

580  PRINT  E,LGTW 

600  IF  E>50  THEN  630 

610  E=E+El 

620  GOTO  320 

630  END 

1000  X=INT (G/ (1 . 2395/L) )+l 

1010  A=2*X*1.2395/L*1.602E-19*754tX*W/(SQRE2tX*(E*lE+04)t (2*X)) 
1020  PLOT  1. 2395/L, LGTA 
1030  RETURN 


GaAs  absorption  coefficient  for  photon  processes  from  1 to  4 order  calcu- 
lated with  the  program  listed  in  Table  4 is  shown  in  Fig.  11.  The  material 
parameters  used  in  this  calculation  are  listed  in  Table  5.  Experimental 
data  for  the  absorption  coefficient  of  GaAs  are  available  for  the  first- 
order  and  second-order  transitions  only;  they  are  entered  as  a dot,  a 
vertical  bar,  and  a circle  in  Fig.  11.  It  is  seen  that  the  agreement 
at  1.5  eV  (one-photon  linear  absorption)  and  0.94  eV  (two-photon  nonlinear 
absorption)  between  the  Keldysh  model  prediction  and  the  experiment  is 
quite  good.  The  sharp  decline  in  absorption  at  the  increase  of  the  order 
of  the  photon  multiplicity  will  be  moderated  by  the  contribution  from 
the  excitonic  states.  Since  the  calculation  does  not  include  exciton 
effects,  one  can  expect  that  the  experimental  values  will  not  agree  with 
the  predictions  in  the  range  of  photon  energies  up  to  0.1  eV  less  than 
that  of  the  absorption  edge  for  a particular  order  of  photon  multiplicity. 
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LOG  ABSORPTION  COEFFICIENT  a 


Fig,  11.  Multiphoton  absorption  coefficient  of  GaAs  at  room  temperature  calculated 
from  eqs.  (4,10)  and  (2,2)*  See  Table  5 for  material  parameters  used  in  the  calcula- 
tion, The  dimension  of  the  absorption  coefficient  is  given  in  units  of  cip2v*3  wl-y, 
with  v designating  the  order  of  the  absorption  process.  Experimental  data  are 
shown  for  1,5  eV  photon  energy  [16]  (dot),  1*17  eV  photon  energy  [6,  25  cq  28]  (ver^ 
tical  bar),  and  0.94  «V  [29  and  30]  (circle). 
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Furthermore,  impurities  giving  rise  to  energy  levels  within  the  forbidden 
gap  would  also  increase  the  absorption  coefficient.  Undoubtedly,  they  also 
contribute  to  the  large  scatter  of  experimental  values  seen  at  1.17  eV  in 
Fig,  11,  Because  none  of  these  contributions  are  accounted  for  in  the 
Keldysh  model,  the  predicted  values  of  the  absorption  coefficients  should 
be  regarded  as  lower  limits  expected  in  the  case  of  very  pure  materials. 

By  inspection  of  Fig. 11,  one  can  estimate  that  the  second-order  absorption 
will  be  significant  at  light  fluxes  above  10^  W cm"^  and  the  third  order  - 
above  10  (J  W cm"  2 for  the  intrinsic  nonlinear  electronic  transition  pro- 
cesses in  direct-bandgap  materials.  Absorption  coefficients  for  several 
Semiconductors  of  this  type  are  listed  in  Tables  5 and  6 for  fixed  laser 
wavelengths  of  doubled  Nd-^-glass  (0.53  pm) , ruby  (0.694  pm),  Nd  -glass 
(1.06  pm),  Nd3+-YAG  (1.318  pm),  HF  (2.8  pm),  DF  (3.9  pm),  CO  (5.3  pm), 
and  CC^  (10.6  pm). 

Since  the  order  of  the  transition  is  the  dominant  factor  in  deter- 
mining the  magnitude  of  the  absorption,  a large  variation  in  the  bandgap 
with  the  temperature  implies  a very  large  temperature  dependence  of  the 
absorption  coefficient.  This  is  demonstrated  in  the  last  entry  of  Table 
6 by  listing  absorption  coefficients  of  PbS  calculated  at  300  K and  0 K. 
Because  the  perturbative  methods  have  been  carried  out  to  the  second  order 
only,  comparison  of  the  second-order  nonlinear  absorption  coefficient  cal- 
culated for  the  three  models  of  sections  4 and  5 is  given  in  Table  7.  For 
material  parameters  used  in  these  calculations  and  comparison  with  the 
available  experimental  data,  the  reader  is  referred  to  Ref.  32.  We  note 
that  the  absorption  coefficient  calculated  from  the  Keldysh  model  generally 
falls  in-between  the  two  perturbation  models  of  Braunstein  and  Basov. 

However,  the  Basov  model  usually  overestimates  the  absorption  coefficient; 
therefore,  the  Keldysh  model  values  yield  currently  the  best  estimate  of 
the  lower  limit  of  the  nonlinear  absorption  coefficients. 

7.  CONCLUSIONS . Comparison  of  theoretical  models  of  Keldysh,  Braun- 
stein, and  Basov  in  calculating  the  nonlinear  absorption  coefficients  for 
direct  band  semiconductors  with  forbidden  energy  bandgaps  between  2.6  to 
0.15  eV  show  that  the  second  order  perturbation  models  of  Braunstein  and 
Basov  differ  in  their  prediction  of  the  by  almost  three  orders  of  mag- 
nitude. The  Keldysh  model  prediction  for  <*2  lies  between  the  two  perturba- 
tion models  and  is  usually  slightly  lower  than  the  experimental  value.  A 
large  scatter  in  reported  experimental  values  of  ctn  (perhaps  attributable 
to  the  presence  of  impurities  in  real  materials)  hinders  in  selecting  the 
best  theoretical  approach.  However,  generally  it  can  be  stated  that  if  the 
interference  terms  in  calculating  transition  rates  are  neglected,  the  absorp- 
tion constants  are  underestimated  (Braunstein  and  Ockman) . Calculations 
allowing  transitions  from  several  bands  (Basov)  generally  overestimate  the 
absorption.  The  Keldysh  method  in  which  the  effect  of  the  optical  electro- 
magnetic field  on  the  eigenfunctions  of  the  unperturbed  system  is  incorporated 
at  the  beginning  of  the  calculation  gives  a slight  underestimate  of  the  two- 
photon  absorption.  It  describes  very  well  the  band  edge  absorption  of  the 
one-photon  process.  Intuitively,  one  would  concede  that  the  perturbation 
of  the  eigenstates  of  the  noninteracting  system  by  the  light  intensities 
of  interest  is  significant  and  that  the  perturbed  eigenfunctions  should  be 
introduced  at  the  onset  of  calculations  as  it  is  done  in  the  Keldysh  model. 

In  addition,  the  Keldysh  model  is  the  only  one  currently  available  to 
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Table  5.  Absorption  coefficients  of  order (1) to (3) calculated  from  the  Keldysh  model  for  direct  bandgap  semicon' 
due tors  of  2.6  to  1.2  eV  bandgap.  Material  parameters  listed  in  [18]  and  [31]  were  used;  m*  denotes  the 
reduced  effective  pair  mass  and  e is  the  high  frequency  dielectric  constant, 
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Table  7,  Comparison  of  calculated  two-photon  absorption  coefficient 
cm  W-*-  for  direct  bandgap  semiconductors.  Absorption  coefficients  were 
calculated  from  eq.  (2.2);  for  transition  rates,  eqs.  (4.10),  (5,1),  and 
(5.12)  were  used  for  Keldysh,  Braunstein,  and  Basov  models,  respectively. 
Contribution  from  only  one  valence  band  is  listed  under  Basov;  the  total 
absorption  coefficient  should  include  transitions  from  both  valence  bands 
and  should  be  nearly  twice  as  large. 


ot2  [cm  W"1] 
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calculate  absorption  coefficients  higher  than  second  order  since  there  has 
been  no  published  work  carrying  perturbation  theory  to  a higher  than  second 
order  in  semiconductors.  However,  because  intensities  above  HP  W cm"^ 
are  required  to  attain  a significant  absorption  in  the  third  order  in  pure 
materials,  such  calculations  may  be  superfluous  for  practical  applications 
because  of  the  onset  of  Drude  absorption  by  free  carriers  created  through 
the  avalanche  multiplication  induced  by  the  optical  electric  field.  In 
fact,  the  notion  that  a single  physical  process  can  explain  the  effects  of 
an  intense  laser  pulse  on  a given  optical  material  is  too  simplistic  to 
hold  because,  for  example,  the  multiphoton  absorption  and  the  avalanche 
ionization  are  two  competing  processes  having  a different  time  dependence. 

On  the  other  hand,  this  contribution  demonstrates  that  it  is  impor- 
tant to  include  properly  the  band  parameters  of  the  solid  state  into  the 
calculation.  In  particular,  if  contributions  from  excitonic  states  and 
impurity  levels  should  be  included,  a summation  over  all  corresponding  ele- 
ments of  the  interaction  Hamiltonian  must  be  incorporated  into  the  calcula- 
tion of  absorption  coefficients.  The  outline  of  such  a procedure  is  given 
in  R|f,  33.  Furthermore,  up  to  the  third  order,  it  is  sufficient  to  use 
the  p * A interaction  term  in  the  electric  dipole  approximation. 
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A TIME -DEPENDENT  QUANTIZED  NATURAL  COLLISION  COORDINATES  METHOD* 


Norman  M.  Witriol 
Physical  Sciences  Directorate 

US  Army  Missile  Research,  Development  and  Engineering  Laboratory 

US  Army  Missile  Command 
Redstone  Arsenal , AL  35809 

ABSTRACT.  In  the  reactive  or  nonreactive  molecular  collisions! 
energy  transfer  problem,  the  quantum  mechanical  natural  collision  coord- 
inate method  has  been  shown  to  be  useful  in  predicting  the  distribution 
of  the  energy  of  the  reactants  into  the  translational  and  vibrational 
energies  of  the  products.  This  method  has  only  been  employed  in  a time- 
independent  formalism.  However,  time-dependent  studies  of  this  problem 
via  the  solution  of  the  time -dependent  Schrodinger  equation  in  the  origi- 
nal coordinates  have  revealed  some  interesting  insights  into  the  dynam- 
ical effects  taking  place.  With  the  goal  of  obtaining  the  advantages 
of  using  natural  collision  coordinates  in  a time-dependent  method,  a 
study  of  time-dependent  quantum  mechanical  canonical  point  transforma- 
tions has  been  initiated.  One  finds  that  by  proceeding  to  the  Klein- 
Gordon  equation,  a similar  formalism  to  that  used  in  handling  time- 
independent  quantum  mechanical  canonical  point  transformations  can  be 
employed.  Applying  this  method,  one  obtains  the  transformed  Hermitian 
coordinate  momenta  and  the  transformed  Hermitian  Hamiltonian.  The 
Klein-Gordon  equation  and  the  transformed  wavefunction , in  the  trans- 
formed space,  are  explicitly  displayed. 


*The  full  text  of  this  paper  will  be  published  elsewhere. 
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COMPUTER  SIMULATION  OF  THE  INTERMEDIATE  BALLISTIC  ENVIRONMENT 

OF  A SMALL  ARM 


Csaba  K.  Zoltani 
Fluid  Mechanics  Branch 
Applied  Mathematics  § Sciences  Laboratory 
USA  Ballistic  Research  Laboratories 
Aberdeen  Proving  Ground,  MD  21005 


ABSTRACT , In  this  paper  the  results  of  computation  of 
the  gas  flow  in  the  presence  of  the  moving  projectile  in  the 
intermediate  ballistic  range  are  presented.  The  mathematical 
model  is  based  on  the  assumptions  of  cylindrical  symmetry  with 
unsteady,  inviscid,  compressible  motion  of  a one  component  gas. 

The  flow  variables  at  the  muzzle  plane  were  varied  continuously 
in  accordance  with  the  prediction  of  an  interior  ballistic 
calculation ! * 

Velocity,  pressure  and  temperature  time  histories,  as  well 
as  pressure  contours  of  the  flow  field  were  computed  up  to  real 
times  of  35ysec.  The  fine  meshing  employed  in  the  difference 
technique  enabled  the  resolution  of  significant  flow  details, 
including  the  backward  facing  shock  at  the  base  of  the  projectile. 
The  calculated  results  are  in  satisfactory  agreement  with  avail- 
able analytical  and  experimental  data. 

1.  INTRODUCTION.  About  lOOysec.  before  the  base  of  the 
projectile  clears  the  plane  of  the  muzzle  of  an  M- 16  rifle,  the 
first  indication  of  gas  motion  out  of  the  gun  tube  becomes  apparent. 
This  is  in  the  form  of  a weak  air  shock,  traveling  at  approximately 
M=1.5  with  the  pressure  rise  of  the  order  of  16  atmospheres. 

About  lOysec.  later,  the  gases  which  leaked  around  the  projectile, 
while  the  latter  is  still  in  the  barrel,  appear  starting  to  form 
the  characteristic  gas  cloud.  First  evidence  of  a vortex  forma- 
tion around  the  lip  of  the  muzzle  is  also  observed  at  this  time. 

Once  the  base  of  the  projectile  clears  the  muzzle,  the  combustion 
products  stream  out  of  the  gun  tube,  and  envelope  the  projectile, 
which  at  a velocity  of  990  [?■]  , is  moving  slower  than  the  gases.' 
Near  the  centerline  of  the  muzzle  the  pressure  is  around  600 
atmospheres  and  temperature  approximately  2500°K.  A backward 
facing  shock  forms  on  the  base  of  the  projectile,  which  as  the 
projectile  accelerates,  gradually  weakens  and  disappears  as  soon 
as  the  powder  bell  is  exited  by  the  bullet. 

The  shock  due  to  the  powder  bell  behaves  in  the  far  field 
like  a point  explosion  with  the  center  of  energy  deposition  located 
about  half  a caliber  in  front  of  the  gun  tube.  The  motion  of 
this  shock  front  is  described  quite  well  by  self-similar  calcula- 
tions. At  about  lOOysec.  after  shot  ejection,  as  the  projectile 
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leaves  the  intermediate  ballistic  range,  the  powder  bell  is 
approximately  17.5  calibers  in  radius  and  22  calibers  in  length, 
measured  downstream  from  the  muzzle.  A typical  shock  Mach  number 
at  this  time  is  around  M ~ 2.0. 

The  presence  of  muzzle  devices  complicates  the  flow  field 
considerably  due  to  shock  refraction  and  wave  interaction.  The 
muzzle  brake  used  in  this  study,  though  an  idealized  version  of 
the  standard  M-16  muzzle  attachment,  exhibits  all  the  salient 
characteristics  of  the  flow  field. 

The  basic  features  of  muzzle  flow  have  been  understood  since 
the  pioneering  researchers  of  Cranz  [1]  in  the  1920's.  Numerical 
calculation  of  the  flow  field  have  been  attempted  only  recently, 
notably  by  Oswatitsch  [2],  Taylor  [3]  and  the  group  at  Dahlgren 
[4].  The  latter  two  are  not  quite  satisfactory,  as  explained 
elsewhere  [5],  which  motivated  the  current  study. 

II.  Mathematical  Analysis  of  the  Flow  Field. 

A.  The  Flow  Model.  The  calculations  are  based  on 
a two-dimensional  time  dependent  flow  model.  The  geometry  of 
the  flow  field  (and  of  the  muzzle  devices)  is  assumed  to  be 
circular  symmetric  with  respect  to  the  axis  of  the  gun  barrel. 

The  flowing  medium  is  assumed  to  be  a one-component,  inviscid, 
non-reacting  gas. 

At  the  beginning  of  the  calculations  the  projectile 
is  placed  1.5mm  (i.e.,  three  mesh  widths)  in  front  of  the  muzzle 
and  assumed  to  have  the  "muzzle  velocity"  of  990m/s. 

The  initial  conditions  for  the  gas  are  ambient  pres- 
sure (1  atm)  and  temperature  ( 3 0 OK) , and  zero  velocity  everywhere, 
except  between  exit  plane  of  the  muzzle  and  the  projectile.  At 
that  location  we  assume  high  enthalpy  and  sonic  (choked)  flow 
conditions,  constant  across  the  muzzle  opening.  (The  initial  gas 
velocity  is,  e.g.,  1210  m/s).  The  variation  of  these  conditions 
at  the  exit  plane  with  time  are  taken  from  one-dimensional  interior 
ballistics  calculations  [6]  . Because  the  exit  flow  is  sonic,  such 
calculations  are  independent  of  the  intermediate  ballistics  events 
and  constitute  the  time  dependent  boundary  condition  at  the  muzzle 
for  the  present  calculations. 

The  projectile's  acceleration  during  the  intermediate 
ballistics  phase  is  computed  from  pressure  forces  in  accordance 
with  Newton's  law  of  motion.  Viscous  drag  forces  are  neglected 
because  the  working  medium  is  assumed  to  be  inviscid  for  the 
present  model. 

A schematic  of  the  computing  mesh  and  initial  conditions 
is  shown  in  Figure  1. 

The  governing  equations  for  the  gas  are  in  cylindrical 
coordinates  are  as  follows: 
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E is  the  specific  total  energy  of  the  fluid,  defined  by 
E = 1/2  (u^  + v^)  + I where  I is  the  specific  internal  energy. 


Equations (1-4) , the  basis  of  our  analysis,  represent  the 
conservation  of  mass,  momentum  in  the  two  co-ordinate  directions, 
and  energy  respectively.  The  Noble-Abel  equation  of  state,  valid 
for  most  propellant  reactions  products,  makes  the  above  system 
determinate.  It  is 
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where  0 is  the  co- volume,  for  the  calculation  at  hand  having 

m3 

a value  of  0.001  [^7],  and  is  the  mole  mass  of  the  gas  with 
R the  universal  gas  constant.  The  temperature  is  calculated  from: 


x = m 

R 

with  M the  molecular  weight  of  the  gas  and  I is  the  specific 
internal  energy. 

B.  The  Boundary  Conditions.  The  boundaries  were 
handled  in  the  following  manner:  along  the  axis  of  symmetry,  re- 

flection was  used,  i.e.  a row  of  imaginary  cells  was  defined  in 
which  all  the  quantities  have  the  same  values  as  in  the  last 
row  of  cells  in  the  flow  field  except  for  the  normal  component 
of  the  velocity,  the  sign  of  which  reversed  in  the  imaginary  cell. 
The  same  technique  was  used  along  all  the  solid  boundaries,  that 
is, along  the  muzzle  walls,  the  projectile  and  the  muzzle  device. 

Outflow  along  the  free  boundaries  had  to  be  guaranteed. 
To  prevent  wave  reflection,  which  was  observed  as  the  shock  inter- 
sected the  last  computational  cell,  a new  method  of  handling  the 
free  boundaries  was  introduced,  see  Figure  2.  In  essence,  it  in- 
volves subtracting  out  mass  in  the  last  three  cells,  ensuring  that 
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a negative  pressure  gradient  exists  there  facilitating  the  out- 
flow. 


The  conditions  at  the  muzzle  were  specified  by  an 
interior  ballistics  calculation  [6] . For  the  first  SOys  after  the 
projectile's  ejection  they  were  as  follows: 

u = 1210  - 1.340  x t 
v = 0.000 

p • (486.0  - 0.561  x t)  x 105 
p = 40.95  - 0.092  x t 

where  t is  specified  in  microseconds. 

C.  The  Finite  Difference  Technique.  The  calculation 
proceeds  in  two  steps.  The  first  step  is  pure  Lagrangian  in  that 
the  convective  terms  are  dropped  from  the  equations  of  motions. 

The  pressure  gradients  are  approximated  by  leapfrog  differencing 
with  the  pressure  of  the  cell  boundary  being  an  average  of  the 
two  neighboring  cells  in  each  co-ordinate  direction.  The  result 
of  the  first  step  is  a set  of  intermediate  quantities,  no  physical 
significance  being  abscribed  to  their  values.  It  should  be  noted 
that  in  the  energy  equation,  the  velocities  used  are  averages  of 
the  initial  and  the  intermediate  values  so  that  in  fact  I-,  the 
intermediate  energy  of  the  flow  at  location  j,  uses  a different 
velocity  base  than  the  momentum  equation.  Although  this  makes 
the  scheme  in  a sense  inconsistent,  the  original  developers  of  the 
algorithm  [7]  claimed  that  this  procedure  was  necessary  to  ensure 
energy  conservation. 

Phase  two  of  the  algorithm  updates  the  flow  quantities 
to  the  final  time  by  taking  account  of  the  transport  terms  neglected 
in  phase  one.  The  continuity  equation  uses  donor  cell  differenc- 
ing, i.e.  the  mass  flow  across  the  side  of  a cell  is  calculated 
by  taking  the  density  of  the  donor  cell  and  the  velocity  is  a 
weighted  average  of  the  velocity  of  the  material  flowing  from  the 
donor  and  the  recipient  cell.  This  velocity  is  obtained  by  a 
Taylor  series  expansion  about  the  cell  boundary.  Analogous  pro- 
cedures are  followed  for  the  momentum  and  the  energy  equations. 

The  time  step  was  determined  by  the  usual  CFL,  region  of  influence 
criterion  with  a factor  0.25,  i.e. 

At  < o.25  ■inQr.At) 
a+/u2+v2 

where  a is  the  local  sound  speed.  The  calculations  were  carried 
out  on  a non-uniform  mesh  with  150  cells  in  the  axial  direction  and  60 
cells  in  the  radial  direction.  Computational  experience  showed 
that  cells  could  not  have  an  aspect  ratio,  L/D  greater  than  1.5 
and  maintain  stability,  and  in  addition,  cells  had  to  be  closely 
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spaced  near  the  plane  of  muzzle,  otherwise  wild  oscillations 
were  observed.  In  the  axial  direction  then,  the  first  three 
cells,  containing  high  pressure  gases  were  spaced  at  5.0  x 10"4[m] 
It  was  at  this  distance  that  the  base  of  the  projectile  was 
put  at  time  t=0.  Dowstream  from  that  location,  the  cell  size 
was  gradually  increased  to  1,0  x 10-^[m]  , then  maintained  constant 
further  downstream. 

Typical  time  steps,  which  of  course  are  a function 
of  mesh  spacing,  were  around:  10”^sec.  Running  times  for  35ysec 
real  time  were  of  the  order  of  4 hours  on  BRLSC  II. 

Ill,  Results  and  Discussion.  The  code  employed  in  these 
calculations  is  a revised  version  of  DORF [ 8 ] , modified  to  in- 
clude moving  boundaries  [9] . Considerable  improvement  of  [9] 
was  necessary,  however,  before  acceptable  results  were  obtained. 

Figure  3 and  4 show  pressure  contours  of  the  flow  field 
at  early  times.  The  isobars,  proceeding  from  the  outermost 
inward,  are  set  at  0.1215,  0.2,  0.4,  0,8,  1.0,  5.0,  10.0, 

20.0,  50.0  [MPa]  respectively.  A shock  is  indicated  where 
these  are  closely  spaced,  all  around  the  periphery  of  the  gas 
cloud.  Due  to  the  inherent  diffusion  of  the  finite  difference 
technique,  the  shock  is  smeared,  being  0.003  [m]  in  thickness. 

At  t ■ 15.41ysec.  (Figure  4)  the  interaction  with  the  muzzle 
device  is  clearly  evident  as  well  as  the  gas  motion  around 
the  tip  of  the  projectile.  The  pressure  at  the  shock  front 
is  typically  6 atmospheres.  The  next  graph,  Figure  5,  shows 
the  pressure  along  the  stagnation  streamline  (i.e.  axis  of  the 
barrel]  for  cycles  100,  200,  300  corresponding  to  real  times 
of  5.50,  10.58,  14.41  [ysec] . At  cycle  100  the  pressure  drops 
from  4 x 10'  [Pa]  at  the  nozzle  to  around  9.0  x 10^  [Pa]  toward 
the  base  of  the  projectile.  A slight  rise  at  the  base  of  the 
projectile  indicates  the  presence  of  a backward  facing  shock. 

This  shock  comes  about  because  the  gases  are  still  moving  faster 
than  the  projectile,  the  base  of  which  appears  like  a blunt 
body  to  the  flow.  The  projectile  pushes  against  the  ambient 
at  its  tip  producing  the  lump  in  the  gas  pressure  there  corres- 
ponding to  a bow  shock.  At  succeeding  cycles  the  results  are 
qualitatively  analogous,  with  the  pressure  continuously  dropping 
at  the  base  of  the  projectile  due  to  the  lateral  expansion  of 
the  gas.  Figure  6 shows  the  radial  pressure  profile  in  a plane 
at  0.0015 [m]  in  front  of  the  muzzle  at  t=15.41ysec  proceeding 
in  the  radial  direction.  The  profile  shows  the  expected  steep 
pressure  drop  in  the  direction  away  from  the  muzzle,  followed 
by  a rise  to  around  5.0  x 10^  [Pa]  at  the  blast  wave. 

In  Figure  7 we  have  plotted  the  radial  temperature  pro- 
file at  t = 34.29ysec.  in  the  plane  1.5  mm  in  front  of  the 
muzzle.  The  interesting  point  here  is  the  temperature  rise, 
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_ 3 

at  a distance  of  34  x 10  [m]  from  the  axis  of  symmetry, 

indicating  the  presence  of  the  shock. 

The  next  figure,  Figure  8,  shows  the  axial  velocity 
along  the  axis  of  symmetry  at  three  different  times.  The  gas 
velocity  increases  downstream  from  the  muzzle  as  the  gases 
expand,  reaching  a value  of  slightly  in  excess  of  2500 [m/s] 
before  steeply  dropping  at  the  base  of  the  projectile.  Near 
the  nose  of  the  projectile,  one  can  observe  a steep  drop  to 
zero,  the  gases  having  been  only  slightly  affected  by  the  mo- 
tion of  the  projectile  at  these  early  times. 

A number  of  checks  were  made  on  the  results  described. 
These  consisted  of  Rankie-Hugoniot  conditions  in  the  normal 
and  radial  directions  to  see  if  the  conservation  laws  were 
obtained.  Deviations  of  up  to  25%  were  observed,  but  in  view 
of  the  fact  that  the  RH  conditions  pertain  to  steady  flow, 
while  here  we  one  dealing  with  an  unsteady  situation,  the  agree- 
ment was  judged  to  be  satisfactory. 

The  shock  stand-off  distance  on  the  base  of  the  projectile 
was  also  calculated.  Using  the  correlation  of  reference  [10] , 


6 _ 1.03 

R " 


(6) 


where  <5  is  the  stand-off  distance,  R the  characteristic  dimension 
and  px  the  density  of  the  gas  ahead  of  the  shoe!;  agreement 
within  10%  of  that  predicted  by  formula  (6)  was  found. 


Finally,  the  calculated  pressure  was  checked  against 
experimental  values.  Measurements  are  considered  to  be  unreliable 
due  to  the  hostile  environment  of  the  flow  field  and  the  fact  that 
the  insertion  of  a probe  into  the  flow  field  will  alter  its 
structure.  Figure  9 shows  the  correlation  between  theory  and 
experiment . 


In  conclusion  then,  we  have  presented  the  results  of 
the  simulation  of  the  flow  field,  for  early  times  at  the  muzzle 
of  a small  caliber  weapon.  The  numerical  results  are  in 
satisfactory  agreement  with  our  qualitative  understanding  of 
the  flow. 
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Figure  1,  Schematic  of  the  Computational  Region  at  t = 0.000 
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Figure  2.  The  Scheme  for  the  Outflow  Boundaries, 
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Figure  3.  Flow  Field  With  Moving  Projectile  Showing  Lines  of  Constant  Pressure, 
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Figure  4,  The  Flow  Field  at  t = 15,41  msec, 
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Figure  5.  Pressure  along  the  Stagnation  Streamline,  The  Cycle  Number  are  100,  200 
and  300  corresponding  to  t = 5,50,  10.58  and  15.41  ns. 
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Figure  6, 


Pressure  Radially  in  a Plane  at  0,0015 (m)  from  the  Muzzle, 
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Figure  7.  Temperature  Radially  in  a Plane  at  0,0015(m)  in  Front  of  the  Muzzle, 
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Figure  8,  The  Horizontal  Velocity  Along  the  Axis  of  Symmetry  at  Cycles  100,  200, 
300,  corresponding  to  t = 5,50;  10,50;  and  15.41  as. 
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Figure  9,  Comparison  Between  Computed  and  Measured  Pressure  Values, 


GENERALIZED  SHOCK  WAVE  PHYSICS: 
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ABSTRACT  The  steady  shock  velocities  for  finite  amplitude  electro- 
magentic  and  second  sound  disturbances  are  derived.  A generalized  form 
for  the  shock  velocity  of  an  arbitrary  disturbance,  in  terms  of  "jump" 
conditions  in  stimulus  and  response  parameters,  is  suggested.  That 
generalized  form  is  compared  with  the  derived  shock  velocities,  and  with 
the  shock  velocity  for  a finite  amplitude  mass  density  disturbance. 

I.  INTRODUCTION 

A steady  state  finite  amplitude  compressional  mass  density  disturbance 
is  characterized  by  a shock  velocity,  u » given  by 

(1)  U = 

where  P dentoes  pressure,  p is  mass  density,  the  subscripts  i and  f de- 
note initial  (unshocked)  and  final  states  respectively,  and  the  curly 
brackets  indicate  that  the  quantity  is  to  be  evaluated  across  the  shock 
front  (surface) . 

{P}  - P.p  - P. 

f 1 

The  form  of  Eq.  (1)  has  been  known  for  more  than  a century^. 

In  sections  II  and  III  we  will  derive  expressions  for  the  shock 
velocity  associated  with  finite  amplitude  electromagnetic  and  second 
sound  disturbances.  Those  expressions  will  be  seen  to  be  of  a form 
similar  to  Eq.  (t) , and  lead  us  to  suggest  that  a generalized  finite 
amplitude  disturbance  propagates  at  a shock  velocity  given  by 

(2) 
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where  A.  denotes  the  variable  assoicated  with  the  applied  ith  stimulus, 
B.  the  variable  characterizing  the  corresponding  response,  and  <f>  is  a 
quantity  assoicated  with  dimensionality. 

IX.  ELECTROMAGNETIC  SHOCK  WAVES 


Let  us  consider  the  two  time-varying  Maxwell's  equations  for  elec- 
tromagnetic field  propagation  in  a medium. 


(3) 
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where  gaussian  units  have  been  used,  cQ  is  the  velocity  of  light  in  vacu- 
um, and  the  symbols  have  their  usual  meaning 


We  restrict  ourselves  to  spatial  variation  only  in  the  (or  z) 
direction.  This  corresponds  to  the  so  called  "one- dimensional"  strain 
configuration  of  mass  density  shock  wave  problems.  Thus 
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The  steady  shock  can  be  described  by  employing 
in  Eqs.  (5)  and  (6),  The  second  of  Eqs.  (5)  and  the  first  of  Eqs^  (6) 
then  give 


(7) 

(8) 
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where  dependence  on  only  a single  independent  variable  allows  total 
differentials  to  be  used* 

Integrating  Eqs.  (7)  and  (8)  across  the  shock  front,  which  may  be 
of  finite  thickness,  and  solving  for  U , gives 


(9) 


U = 
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{D1HB2} 
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For  the  special  case  = 0,  Eq.  (9)  is  the  form  of  Eq.  (2); 

E-^  and  H2  are  identified  as  stimuli,  and  D-^  and  the  corresponding 
response  variables.  The  integral  in  Eq.  (9)  is  to  be  taken  across 
the  shock  front.  Physically,  J-j_  enters  as  it  does  because  a surrounding 
magnetic  field  is  always  associated  with  a current,  Eq.  (9)  is  well 
known.  Indeed,  it  is  the  basis  for  important  effects  in  pulse  shaping 
and  transmission  line  technology^. 

The  existence  of  a shock  wave  is  generally  associated  with  non- 
linear material  properties.  Although  we  have  not  explicitely  exhibited 
such  nonlinearities  in  arriving  at  Eq,  (9),  they  are  in  principal 
present.  For  example,  can  have  quadratic  dependence  on  E^,  and/or 
field  dependent  breakdown  can  contribute  to 

In  actual  practice  the  existence  of  nonlinear  material  properties 
is  a necessary  but  not  a sufficient  condition  for  the  existence  of  a 
shock  wave.  In  order  for  an  electromagnetic  shock  to  exist,  it  is 
necessary  that  the  nonlinearity  be  such  that  IJ  increases  as  the  magni- 
tude of  and/or  increases.  Suffice  it  to  say  that  systems  do  exist 
which  have  nonlinearities  of  the  type  necessary  for  the  formation  of 
electromagnetic  shock  waves^. 

Regardless  of  whether  or  not  the  shock  condition  exists,  Eq,  (9) 
gives  the  velocity  of  that  finite  amplitude  disturbance  being  propagated. 
As  it  must,  Eq,  (9)  yields U = cq  for  propagation  in  a vacuum, 

III.  SECOND  SOUND  SHOCK  WAVES 


Second  sound  is  typically  a low  temperature  phenomenon.  It  is  de- 
fined as  the  coherent  propagation  of  a thermal  disturbance.  In  pure 
second  sound  there  is  no  associated  mass  density  (or  pressure)  disturbance. 
The  more  everyday  (say  room  temperature)  mode  of  thermal  propagation  is 
via  incoherent  diffusive  processes;  second  sound  conditions  are  achieved 
when  the  frequency  content  of  the  thermal  disturbance  is  of  the  order 
of  the  reciprocal  of  the  thermal  relaxation  time  for  the  system  in 
question^ > 

Second  sound  was  first  observed ^ in  liquid  helium  (helium  II),  and 
has  since  been  observed  in  non- super flu id  materials^.  Finite  amplitude 
second  sound  can  give  rise  to  second  sound  shock  which  has  been  obser** 
ved^>9  ±n  liquid  helium.  In  this  paper  we  will  derive  the  velocity  of 
a steady  second  sound  shock  in  liquid  helium. 

Within  the  context  of  the  two  fluid  model  for  superfluidity  the 
equations^  for  the  conservation  of  mass,  the  conservation  of  momentum, 
and  the  conservation  of  entropy,  become 
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In  Eqs.  (1)  - (12)  the  physical  parameter  s denotes  specific  entropy, 
T is  temperature,  v is  the  particle  (flow)  velicity,  and  the  subscripts 
s and  n refer  to  the  superfluid  and  normal  components  respectively. 

P3  and  PR  are  constrained  by  P " ps  pn  5 and  on^y  normal  com- 
ponent  is  a carrier  of  entropy.  Second  sound  gives  the  additional  con- 
straint 


(13)  dp  = dp  = 0 . 

As  in  the  electromagnetic  case  we  employ 
Thus  from  Eq.  (11a) 
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Integrating  Eq. 
(15) 


(14)  with  d,P  = 0 gives 
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Substituting  the  steady  state  shock  condition  into  Eq . (12)  gives 


(16a) 

(16b) 
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Combining  Eqs.  (14),  (15),  and  (l6b)  yields 
(17)  | U - ±-  I I — [sdT  |ds  - r ^ 
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with  the  final  result 


(18) 
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While  use  of  Eq,  (18)  is  limited  iij^that  Eqs,  (11)  represent  only 
the  linearized  version  of  superfluidity  , it  is  important  to  realize 
that  the  same  procedure  utilized  here  is  applicable  to  the  nonlinear 
case;  the  result  would  be  a more  complicated  version  of  Eq.  (18), 


To  begin  comparing  Eqs-  (2)  and  (18)  we  neglect  the  inner  integral 
in  the  denominator  if  Eq.  (18),  and  we  average  the  integrand  in  the  num- 
erator. The  result  should  apply  to  finite  though  small  amplitude  shocks 
and  is  given  by 
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Upon  identification  - (PsPn  s ^avg  we  have  the  desired  form 

of  Eq  * (2),  T is  recognized  as  the  stimulus,  s the  response,  and  second 
sound  is  to  be  viewed  as  the  propagation  of  a coherent  entropy  disturbance. 


In  the  limit  of  an  infinitesimal  disturbance  Eq.  (19)  reduces  to 
the  accepted^  expression  for  second  sound  velocity, 
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where  is  the  specific  heat  at  constant  pressure. 

The  next  degree  of  analysis  of  Eq.  (18)  is  to  consider  the  integral 
in  the  numerator.  This  represents  a situation  similar  to  the  integral 
over  the  current  density  in  Eq.  (9),  and  is  present  because  $ is  a 
strong  function  of  the  stimulus  T.  Physics  and  dimensional  analysis 
would  presumably  have  allowed  one  to  use  Eq.  (2)  in  order  to  guess  at 
the  form  of  the  numerator  in  Eq.  (18).  The  form  of  the  inner  integral 
in  the  denominator  of  Eq.  (18)  is  somewhat  less  obvious. 

Again  not  all  nonlineatities  are  sufficient  to  cause  a positive 
(AT)  disturbance  to  shock  up.  The  existence  of  such  a shock  depends 
upon  the  detailed  behavior  of  (p^p^s2)  as  a function  of  T. 

By  using  the  pure  second  sound  conditions  of  Eq.  (13),  we  have  ne~ 
glected  the  higher  order  physics  associated  with  thermal  expansion  and 
the  Gruneisen  parameter.  The  usual  literature^  derivation  of  the 
second  sound  shock  velocity  is  perturbative  (second  order  in  A T)  and 
includes  the  mentioned  higher  order  effects.  The  form  of  the  second 
sound  shock  velocity  thus  arrived  at  is 


(21) 
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The  higher  order  effects  can  be  included  in  the  approach  leading  to 
Eq.  (18)  by  retaining  the  terms  in  do  and  dP.  It  is  interesting  to 
note  that  Eq.  (19)  can  be  placed  in  a form  similar  to  Eq.  (21),  al- 
though without  the  corresponding  physical  content,  by  using 


(22) 
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IV.  DISCUSSION 

We  have  shown  that  an  intelligent  use  of  Eq.  (2)  is  a good  starting 
point  for  guessing  at  the  shock  velocity  of  an  arbitrary  disturbance  in 
a complicated  system.  If  we  could  not  have  guessed  at  the  denominator 
of  Eq.  (18),  we  could  certainly  have  written  an  empirical  form  for{B) 
of  Eq.  (2)  which  might  have  suggested  the  correct  solution. 


Among  the  more  esoteric  nonlinear  systems  to  w^ich  one  might  wish 
to  apply  Eq*  (2)  are  those  of  quantum  field  theory^  (propagation  of  a 
probability  amplitude) , and  magnetohydrodynamic  (MHD)  shocks  in  gases^ 
and  solids-^.  A propagating  distrubance  in  a nonlinear  medium  can,  in 
general,  give  rise  to  shock  formation,  and  its  study  may  be  of  interest 
for  a wide  variety  of  physical,  biological  or  social  systems. 
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ON  RIEMANN'S  INVARIANT  AND  SHOCK  IMPEDANCE  OF  SOLIDS 


Y,  K.  Huang 

Benet  Weapons  Laboratory 
Watervl iet  Arsenal 
Watervliet,  New  York  12189 


ABSTRACT.  Using  the  Riemann  invariant  and  Rankine-Hugoniot 
jump  conditions,  analysis  is  made  to  show  the  basic  difference 
and  interrelationship  between  acoustic  and  shock  impedances  of 
solids.  The  investigation  also  arrives  at  a new  formulation  for 
the  general  equation  of  shock  and  particle  velocities  in  both 
binomial  and  polynomial  forms.  The  nonlinear  binomial  should 
be  of  special  interest  to  those  who  used  to  work  on  shock  waves 
in  solids. 

1.  INTRODUCTION . The  Riemann  invariant  is  well  known  in 
connection  with  the  hyperbol i c-type  partial  differential  equations 
and  with  the  transient  analysis  of  nonlinear  waves  or  supersonic 
flows.  Thus,  the  method  of  characteristics  can  aid  considerably 
the  investigation  of  such  relevant  problems  as  shot  propulsion 
or  gun  firing  (internal  ballistics),  muzzle  and  recoil! ess-ri fl e 
blasts  (transition  or  Intermediate  ballistics),  and  hypervelocity 
impact  on  armor  (terminal  ballistics).  This  paper  Is  concerned 
with  a basic  analysis  regarding  the  terminal  ballistic  effects 
in  solids.  For  a given  compression  of  solids,  we  can  put  its 
Isentrope  and  shock  adiabat  in  one-to-one  correspondence  by  using 
the  Mi e-Grunei sen  equation  of  state  in  coupled  form.  Such  con- 
sideration turns  out  to  yield  a number  of  useful  results  and  inter- 
relationships between  the  acoustic  and  shock-wave  properties  of 
solids1*2.  From  yet  another  point  of  view  we  can  determine  a 
sound  wave  which  corresponds  to  a shock  wave  with  the  same  amount 
of  compression.  This  calls  for  a transformation  from  the  pv-plane 
to  the  pu-plane.  Now  each  wave  is  associated  with  an  impedance  of 
its  own  just  as  it  has  Its  own  pressure  amplitude.  Some. results 
from  this  approach  have  been  discussed  In  a recent  paper1*,  and 
further  results  are  given  In  this  paper  for  the  higher-order  effects 
and  interrelationship  between  the  two  Impedances.  It  may  be  noted, 
that  the  Griineisen  parameter  provides  an  Interlink  in  the  pv-plane2 
and  that  the  Riemann  invariant  is  the  similar  tool  in  the  pu-plane4. 
From  Riemann's  invariant  we  can  deduce  the  acoustic  impedance  and 
Its  higher-order  effects.  Likewise,  we  can  evaluate  the  shock 
impedance  using  Hugoniot's  momentum  equation.  Interrelations 
between  the  two  impedances  are  best  demonstrated  by  consideration 
of  a third-order  representation,  which  turns  out  to  yield  the  gen- 
eralized relation  between  the  shock  and  particle  velocities  as  has 
been  used  without  proof  in  Reference  1,  Now  a complete  derivation 
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05’) 


(dp$/du)0  * (dpH/du)0 

(d2ps/du2 )0  * (d2pH/du2)0  (16') 

(d3ps/du3)o  = (d3pH/du3)0  + l/2(dp^/du)”3 (d2pH/du2)0  x 

[1/2(d2pH/du2)o  - G0]  07') 


Thus,  the  Isentrope  coincides  initially  with  the  shock  adiabat 
in  both  pv-  and  pu-plane.  The  two  adiabatic  curves  separate 
from  each  other  on  the  third  order  as  described  by  Eqs.  (17) 
and  (17').  These  interrelations  are  very  useful  in  this 
Investigation.  The  following  are  all  straightforward 
deduction . 

From  Eqs.  (l)-(3),  we  may  write 
Z ■ Z0  + ZQu  + l/2Zp'u2  + • ’ ' 

* (dp$/du)0+(d2ps/du2)0u  + l/2(d3ps/du3)Qu2  + ••• 


Let  us  rearrange  Eq.  (4)  to  denote  the  shock  Impedance  by  Y * 

U/Vq  » pH/u  In  analogy  to  Eqs.  (1)  and  (19).  Then  we  get 

Y " Y.  + Y'u  + 1/2Y''u2  + • - • 

00  0 (20) 

* (dp^/du)Q  + l/2(d2pH/du2)0u+l/6(d3pH/du3)0u2+'  • ■ 

It  Is  Interesting  to  note 

Y0  ■ (dpH/du )0  ■ (dps/du)0  * Z0  (21) 

Vj  ■ 1/2(d2pH/du2)0  ■ l/2(d2ps/du2)q  • 1/2Z„  (22) 

\ ■ l/3(d3pH/du3)0  . l/3{(d3ps/du3)0-l/2(dps/du)'1 

(d2ps/du2)o  x [l/2(d2p5/du2)0-G0]}  - 

1/3  [Z„ ' - 1/2Zg(1/2Zg  - G0)/Z0]  (23) 

by  use  of  Eqs.  (1 )-(4) , (15*  )-(17*  )*  (19),  and  (20).  Substituting 
Eqs,  (21)-(23)  and  (19)  in  Eq,  (20),  we  get 
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Y a Z - 1 /2[Z ' u+(2/3Z ' ' + J-  Z_1z'2  - ~ G^Z^zju2]  (24) 

which  provides  an  interlink  between  the  two  compression  impe- 
dances. Clearly,  Eq.  (24)  is  accurate  to  the  third  order  of 
the  adiabats.  It  is  of  particular  interest  to  note  that  Eq. 

(24)  is  essentially  the  same  as 

u - a0  + 1/2Z„V0U  + 1/6[Zq ' + 1/2Z^(G0  - 1 / 2Z^ > / ZQ ] vQu 2 
• a0  + a.u  - a2u2  (25) 

with  UQ  = aQ  (or  YQ  = Z0).  We  shall  consider  Eq.  (25)  furthere 
in  detail  shortly. 

3.  WEAK  NONLINEARITY.  Thus  far,  we  have  expressed  the 
impedances  in  polynomials.  See  Eqs.  (19)  and  (20),  It  Is 
still  more  illuminating  to  use  the  compact  form:5 

Z = Z0(l  + ps/B)N/2  (26) 

with  constants 

n * u;  + d2/[k;(k;+i)  - uz) 

S * (k'  + 1)K0/[k;<k;  + 1)  - KqKq ' ] . (28) 

Here  KQ,  Kq  and  K ’ are  the  initial  values  of  K = -vdps/dv,  K*  = 
dK/dps  and  K*  1 * cl^K/dp2  at  ps  = 0,  respectively.  From  Eqs.  (1) 
and  (25)-(28),  we  get 

ao  = vozo  * (v0^o)1/2  <29> 

a,  = 1/2v0Z„  - Nv0Z2/4B  = ( + 1 )/4  (30) 

a2  = <-v0/6)[zj'  + l/2Z^(G0-l/2Zj)/Z0]  = (-v0/6)  * 
[1/2(N/B)2  x (1 -N-1 )z|  + (NZ0/4B)(G0-NZ2/4B)] 

■ TT  (v0K0)-1/2^  (<  + ')(^-4y0-7)-K0k;'].  (31) 
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(32) 


Now  Eq.  (31)  may  be  rearranged  as 

-KoKo'  * 2a1(y0-a1+2)  + 12aQa2 

which  verifies  the  explicit  results  of  Reference  1 using  the 
Slater  and  Dugdal e-MacDonal d formulas.  Here  Eqs.  (29)-(31) 
complete  the  formulation  of  Eq.  (25)  which  was  used  without 
proof  in  our  earlier  paper.  It  should  be  noted  that  the 
parabolic  Eq.  (25)  may  become  useless  well  before  it  yields 
U<u.  For  this  reason  and  by  analogy  with  Eq.  (26),  let  us 
consider  the  nonlinear  binomial: 


U = UQ(1  + u/u))n  (33) 

with  0<n<l . From  Eqs.  (25),  (29)-(31),  and  (33),  we  get 

% 

n = a?/ (a?  + Za^,,) 

- 3/4(K^+1 )2/C(K^+1 )(k^-y0-1 )-2K0K„' ] (34) 

W = a0ai / (a?  + 2a0a2) 

■ 3(k;  + i)(v0k0)1/2/[(k;+i)(k;-y0-d-2k0k;'].  os) 


Now  Eq.  (33)  is  not  only  in  closed  form,  but  it  is  more  general 
and  more  accurate  than  Eq.  (25)  provided  that  adequate  data  have 
been  fitted  in  Eqs.  (34)  and  (35)  to  determine  n and  to.  Thus, 
for  a2  = al  = 0 we  get  n = 0 and  U = UQ  = aQ  (the  acoustic 
approximation  of  very  weak  shocks).  For  a2  = 0 only,  we  get 

n = 1 , to  = aQ/a-],  and  U = aQ  + a-jU  (the  widely-used,  linear 

binomial).  Elsewhere^  we  have  shown  this  exactly  by  putting 
N = 1 in  Eq.  (26).  Clearly,  Eq.  (33)  can  be  expanded  to 
yield  the  truncated  form  of  Eq.  (25)  and 

PH(e)  = ea^v*1  m-a^)2  + 2a0a2e2]-1  (36) 

as  given  in  Reference  1 with  e = 1 - v/vQ  = u/U.  If  we  combine 

Eqs.  (4)  and  (33)  with  e,  we  can  determine  a nonlinear  adiabat 
Pr  = PHU)  which  should  be  more  accurate  than  the  truncated 
form  of  Eq.  (36). 

4.  CONCLUSION.  In  this  investigation  we  consider  the 
essential  behavior  and  properties  of  shock  waves  in  solids  from 
a semi -analyti cal  approach.  Our  new  formulation  of  the  velocity 
relation  is  not  only  simple  in  expression  but  also  general, 
including  such  special  cases  as  the  binomial  (linear)  and 
polynomial  (parabolic)  forms  which  are  being  used  widely  in 
the  1 i terature . 
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FREQUENCY  DEPENDENT  WAVE  ARRIVAL  TIME  DELAYS  IN 
DISPERSIVE  AND  NONDISPERSIVE  MEDIA 
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U.  S.  Army  Engineer  Waterways  Experiment  Station 
Vicksburg,  Mississippi  39180 


ABSTRACT.  This  paper  presents  the  results  of  the  application 
of  a technique  for  estimating  seismic  wave  arrival  time  delays  in  dispersive 
media.  The  technique  addresses  the  physical  situation  consisting  of 
two  spatially  separated  sensors  (seismic)  implanted  in  a dispersive 
media  for  which  the  frequency-dependent  time  delays  between  the  respective 
signatures  are  desired.  The  time  delay  at  a given  frequency  is  expressed 
in  terms  of  the  distance  between  the  sensors  and  the  frequency-dependent 
velocity  of  propagation.  Initially,  the  phase  differences  between 
the  two  signatures  are  estimated.  The  accuracy  of  the  estimated  phase 
difference  at  each  frequency  is  then  associated  with  the  coherence 
estimate.  Phase  differences  for  which  the  magnitude  of  the  coherence 
is  sufficiently  large  are  used  to  estimate  the  time  delays.  Results 
of  application  of  the  technique  are  presented,  and  problem  areas  are 
discussed . 


1.  INTRODUCTION.  Time  delays  between  two  measurements  of  wave 
motion  in  dispersive  and  nondispersive  media  play  an  important  role 
in  several  fields  of  endeavor.  Generally,  time  delays  between  two 
(or  more)  signatures  are  desired  when  the  signatures  are  obtained  from 
spatially  separated  sensors  receiving  energy  from  a common  source  as 
illustrated  in  Figure  1.  Such  physical  situations  may  occur  in  such 
fields  as  oceanography  (i.e.  measured  time  delays  between  signatures 
of  wave  motion  at  two  locations  are  used  either  to  predict  wave  motion 
at  a given  location  given  the  measured  signature  at  another  or  to  establish 
the  degree  of  correlation  between  events  at  the  two  locations),  seis- 
mology (i.e.  measured  time  delays  are  inserted  into  bearing  location 
algorithms  to  locate  earth  tremors  and  explosions),  or  acoustics  (i.e. 
based  upon  measured  time  delays,  velocities  of  propagation  of  energy 
through  various  media  are  determined) . 

The  purpose  of  this  paper  is  to  review  the  conventional  approach 
for  making  frequency  dependent  estimates  of  time  delays*  using  the  fast 
Fourier  transform  (FFT) ,**  to  illustrate  the  results  of  application  of 
that  approach  to  particular  types  of  signatures  (seismic  signatures  of 
military  vehicles),  to  point  out  its  limitations  and  to  discuss  a method 
for  avoiding  some  of  those  limitations. 


* R.  K*  Otnes  and  L.  Enochson,  Digital  Time  Series  Analysis,  John  Wiley, 
New  York,  1972. 

**  J.  W.  Cooley  and  J.  W.  Tukey,  "An  Algorithm  for  the  Machine  Calcula- 
tion of  Complex  Fourier  Series,"  Mathematics  of  Computation,  Vol  19, 
1965,  p 297. 
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2 . THEORETICAL  CONSIDERATIONS . The  physical  situation  addressed 
in  this  analysis  consists  of  two  sensors  spatially  separated  by  some 
medium  (dispersive  or  nondispersive) . If  the  energy  source  is  at  a 
sufficient  distance  from  the  two  sensors,  the  incoming  waves  can  be 
described  by  plane  wave  fronts  (Figure  1) . The  line  between  these 
sensors  is  chosen  to  define  the  z axis.  The  energy  is  assumed  to  be 
propagating  in  the  positive  z direction  (away  from  the  source) . The 
particle  displacement  for  the  selected  mode  of  propagation  is  assumed  to 
be  in  one  direction,  although  that  direction  is  arbitrary  (i.e.  depending 
upon  whether  or  not  the  vibration  is  compressional,  transverse,  etc.).* 
The  assumption  of  a single  mode  of  propagation  is  employed  to  avoid  the 
ambiguity  that  may  occur  in  estimating  time  delays  when  several  vibra- 
tional modes  having  different  propagational  velocities  are  present, 
hence  leading  to  different  time  delays  for  each  mode  of  vibration. 

For  a particular  point  along  the  z axis,  and  for  a particular 
frequency,  the  displacement  of  particles  can  be  described  by  the  con- 
ventional expression 

x(z,t)  = A exp  j (2-rrft  - kz) 

where 

x ■ displacement 
t “ time 
A = amplitude 
f = frequency 

k = wave  number  “ 2n/wavelength 

In  practice  it  is  generally  the  rate  of  change  of  x with  respect  to 
time  that  is  measured,  or 

x(z,t)  = 2irfA  exp  j (27rft  - kz  + ~) 

If  we  consider  the  effective  change  in  x at  these  two  sensore  (i.e. 
sensor  1 at  z - 0 and  sensor  2 at  z - d) , the  wave  form  of  these  two 
sensors  becomes 

x^(o,t)  = 2irfA  exp  j (2irft  + ■—) 

X2(d,t)  = 2irfA  exp  j (2irft  - kd  + —) . 

The  difference  between  the  above  wave  forms  is  due  to  a phase  difference 
<f»  = -kd. 

The  wave  number  k can  be  expressed  in  terms  of  the  velocity  ot 
propagation  v and  the  frequency  as  k = 2irf/v.  Therefore, 


* Actually,  the  assumption  ot  motion  along  a single  direction  can  be 
relaxed  to  include  motion  having  only  one  independent  variable  such  as 
for  Rayleigh  waves  where  the  motions  in  orthogonal  directions  are  depen- 
dent on  one  another. 
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♦ =*  -2*f(^) 

The  time  delay  between  signatures  received  at  the  sensors  is  the  quantity 
d/v  and  is  therefore  given  by 


u 2 iff 

In  the  actual  physical  situations,  the  energy  is  spread  over  bands 
encompassing  a broad  range  of  frequencies.  When  a broad  region  of  fre- 
quencies is  present  in  the  signatures,  a plot  of  phase  difference  versus 
frequency  can  be  generated.  For  nondispersive  media  the  velocity  is 
constant  so  that  the  phase  shift  increases  linearly  with  frequency  as 
indicated  in  Figure  2a.  For  dispersive  media  the  phase  shift  may  become 
a complex  function  of  frequency  as  indicated  in  Figure  Zb.  The  time 
delay  between  two  signatures  is  directly  related  to  the  phase  difference  4 
as  indicated  in  equation  1.  Therefore,  emphasis  in  the  remainder  of 
this  paper  is  upon  determination  of  <t>  , since  it  is  the  phase  shift 
that  is  directly  obtainable  from  the  conventional  frequency -by- frequency 
comparison  of  two  signatures. 

With  the  advent  of  the  fast  Fourier  transform  (FFT) , the  complete 
spectrum  can  be  computed  in  reasonable  computational  time.  It  is  now 
practical  to  compute  the  phase  difference  for  each  discrete  frequency 
band  resolved  in  the  frequency  domain.  The  conventional  approach  consists 
of  the  following  steps: 

ji.  The  time  domain  signatures  received  at  two  sensors,  x and 
y , shall  be  denoted  by  x£t)  and  y£t)  . The  frequency 
domain  representation  of  x(t)  and  y(t)  is  obtained  by 
application  of  the  discrete  Fourier  transform  to  obtain 


^ n-1 

= At  £ exp  (-j2T7f^mAt) 
m=0 

^ n~l 

% « At  £ y exp  (-j2irf,  mAt) 
k m=0  m k 


where  At  is  the  sampling  period,  and  y^  are 
x(mAt)  and  y(mAt)  , respectively,  and  X^  and  Y^  are 
complex  variables  with  real  and  imaginary  parts.  The  real 
part  of  X^  (or  Y ) denotes  that  part  of  x(t)  [or  y(t;] 
that  can  be  associated  with  the  cosine  terms  in  Fourier 


expansion  while  the  imaginary  part  can  be  associated  with 
the  sine  terms  in  the  Fourier  expansion.  The  "tildes" 
over  It  and  X indicate  that  X and  Y are  "raw" 

K,  K k K. 
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estimates  of  the  frequency  domain  signatures  without  any 
averaging  over  adjacent  frequency  values  or  sequential 
time  estimates* 


b.  The  cross-spectral  density  estimate,  G . , is  obtained 

xy  ic 

from  and  using  the  defining  expression 


G 


xyk=X£  \ 


where  the  asterisk  denotes  complex  conjugate*  The  result- 
ing expression  for  G can  be  separated  into  real  and 
imagery  parts  in  the  ^orm 

Gxyk  = Cxyk  " j Qxyk 

where  C - is  referred  to  as  the  cospectrum  estimate  and 
Q k is  the  quadrature  spectrum  estimate.  The  power 

spectrum,  or  autospectrum,  is  estimated  for  both  x(t) 
and  y(t)  from  the  expression 

G*k  ■ K \ 

and 

G , = Y*  Y.  . 
yk  k k 

c_.  The  raw  cross-spectral  estimates  can  be  smoothed  over  M 
adjacent  estimates  using  the  expressions 

1 M 

Gxk  M . Gx(k  + j) 


1 M 

Gyk  M ^ Gy(k  + j) 

1 M 

Gxyk  M Gy(k  + j) 
3 t 


Gxyk  ^ ^xyk 
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The  value  of  M selected  for  the  smoothing  is  dependent 
upon  the  physical  properties  of  the  medium  through  which 
the  energy  propagates,  the  desired  accuracy  of  the  esti- 
mates, and  the  desired  frequency  resolution.  As  the  value 
of  M is  increased,  the  bandwidth  of  the  resolved  fre- 
quencies is  increased,  and  the  accuracy  of  the  estimates 
are  improved.  However,  the  bandwidth  of  the  resolved 
frequencies  is  restricted  by  the  degree  of  dispersion 
(i.e.  the  dependence  of  velocity  on  frequency).  This  is 
because  improvement  in  the  accuracy  of  estimates  with 
increased  values  of  M is  dependent  upon  the  assumption 
that  the  velocity  of  propagation  of  energy  is  reasonably 
constant  in  the  bandwidth  of  resolution  of  frequencies. 
Therefore,  M is  restricted  to  values  for  which  this 
assumption  is  true. 


d. 


The  phase  difference,  <J>  , , between  time  series  x(t) 

r xyk. 

and  y(t)  is  estimated  from  the  expression 


arc  tan  (Q^/C^)  . 


An  additional  parameter  having  useful  properties  for  the 
application  of  the  computational  technique  to  time-delay 
estimates,*  or  equivalently,  estimates  of  phase  differ- 
ence, is  the  squared  coherence,  . This  parameter  is 
defined  in  terms  of  the  cross  spectrum  G (f)  and  the 

autospectra  G (f)  and  G (f)  as  XjV 

x y 


y m . 

YxyU;  G (f)  G (f) 

x y 

The  coherence  estimate  can  be  expressed  in  terms  of  the 
previously  obtained  estimates  of  power  and  cross  spectra 
as 


'xy 


* xyk* 

A A 

G . G 1 
xk  yk 


a a a n 

(T  , + Q , 
xyk  xyk 

™ A A 

G . G , 
xk  yk 


3.  APPLICATION.  Application  of  the  conventional  approach  des- 
cribed previously  for  computation  of  the  phase  difference  between  two 
signatures  may  produce  inconsistent  results.  For  example,  the  calculated 


* B.  V.  Hammon  and  E.  J.  Hannan,  "Spectral  Estimations  of  Time  Delays 
for  Dispersive  and  Non-Dispersive  Systems,"  Journal  of  Applied 
Statistics,  Vol  23,  1974,  pp  134-142. 
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phase  difference  between  two  signatures  can  be  so  irratic  in  some  frequency 
bands  that  it  is  impossible  to  identify  which  estimates,  if  any,  are 
reliable.  discussed  by  B,  V.  Hammon  and  E.  J.  Hannan,*  the  coherence 
parameter  (Y  ) can  be  directly  related  to  the  accuracy  of  the  cross- 
spectral  estimate  (hence  phase  difference)  at  a particular  frequency. 
Because  the  accuracy  of  the  phase-difference  estimate  improves  rapidly 
as  the  coherence  increases  toward  unity,  the  coherence  parameter  can  be 
used  to  determine  what  frequencies  can  be  used  to  provide  the  best 
estimates  of  phase  difference.  Results  of  applying  the  conventional 
approach  to  phase-difference  estimates,  ignoring  coherence,  are  presented 
below.  Problem  areas  are  identified.  The  coherence  parameter  is  then 
included  in  the  analysis  and  the  resulting  contribution  of  coherence  to 
identifying  reliable  phase  difference  estimates  is  discussed. 

Application  excluding  coherence.  The  first  calculation  of  phase 
difference  considered  is  the  result  of  analysis  of  seismic  signatures  of 
an  M113  AFC  (armored  personnel  carrier)  collected  with  two  geophones 
spaced  6 m apart.  The  phase  plot  (Figure  3a)  suggests  that  there  is  a 
continuous  functional  dependence  between  the  frequency  and  phase  difference 
in  the  approximate  frequency  bands  5-25  Hz  and  50-210  Hz.  However, 
scattering  in  the  phase-difference  estimates  is  clearly  evident  between 
25-50  Hz  and  at  frequencies  greater  than  210  Hz  with  some  scattering  in 
all  bands.  Observation  of  the  power  spectrum  (Figure  3b)  shows  that 
power  is,  on  the  average,  somewhat  larger  in  the  50-210  Hz  band  than  in 
^ 25-50  Hz  and  210-250  Hz  bands  and  considerably  larger  in  the  0-25  Hz 
region.  It  can  be  postulated  that  the  contributions  of  random  noise  in 
the  instrumentation  and  background  seismic  noise  combine  to  mask  the 
contributions  of  the  signature  of  the  M113  APC  in  low-energy  spectral 
regions.  The  resulting  estimate  of  phase  difference  in  the  "lower 
energy"  bands  appear  to  be  random. 

Often  it  is  desirable  for  the  investigator  to  determine  phase 
differences  from  two  or  more  energy  sources  simultaneously.  Such  is 
illustrated  in  Figure  4;  two  seismic  sources  are  present,  and  M151  jeep 
and  an  M35  truck.  An  ambiguity  then  arises  as  to  what  the  "correct" 
time  delay  is.  For  instance,  the  effective  distances  between  the  geo- 
phones for  each  of  the  two  sources  are  different  so  that  the  resulting 
time  delays  in  the  reception  of  the  respective  energy  contributions  of 
the  two  sources  are  different.  This  ambiguity  results  in  scattering  of 
the  estimates  of  phase  difference  in  the  frequency  regions  where  the 
energy  from  the  respective  sources  overlap.  Estimates  of  phase  difference 
for  the  physical  situation  illustrated  in  Figure  4 are  presented  in 
Figure  5a,  The  estimates  are  noticeably  scattered.  Inspection  of  the 
power  spectrum  (Figure  5b)  shows  that  the  energy  is  again  primarily 
concentrated  below  25  Hz.  However,  several  strong  lines  appear  in  the 
spectrum  (for  example  at  approximately  80,  90,  and  170  Hz)  suggesting 
that,  at  least  in  these  narrow  frequency  bands,  one  source  could  be 
dominating  over  another  (i.e.,  it  would  be  unlikely  that  the  two  sources 


*Ibld. , p 5. 
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would  generate  peaks  of  energy  in  che  same  narrow  frequency  bands).  It 
should  be  possible  to  identify  reliable  phase-difference  estimates 
(hence  time  delays)  for  these  narrow  frequency  bands.  However,  reliable 
estimates  may  exist  in  bands  not  readily  identifiable  in  the  power 
spectrum. 

Application  including  coherence.  In  view  of  the  difficulties 
encountered  in  estimating  phase  differences  between  signatures  generated 
by  multiple  sources  or  containing  random  noise,  some  criterion  is  desirable 
for  selecting  potentially  accurate  phase-difference  estimates.  As 
discussed  by  Hammon  and  Hannan,*  the  coherence  parameter  provides  an 
estimate  of  the  accuracy  of  the  phase-dif ference  estimates.  The  coherence 
spectra  for  the  two  cases  discussed  previously  are  presented  below  and 
the  phase  differences  are  discussed  in  view  of  the  associated  coherence 
spectra. 

The  coherence  spectrum  for  the  single  energy  source  (M113  APC)  is 
presented  in  Figure  6a.  The  average  value  of  the  coherence  is  relatively 
low  in  the  frequency  region  between  25  and  50  Hz  and  for  frequencies 
greater  than  210  Hz,  There  are  large  variations  in  the  coherence  in  the 
50-210  Hz  region.  If  the  phase  difference  is  plotted  only  for  values  of 
coherence  greater  than  0.5,  only  the  phase  difference  appearing  in 
Figure  6b  appears.  Comparison  of  Figure  6b  with  the  previous  plot  of 
phase  difference  (Figure  3a)  reveals  that  many  estimates  have  been 
eliminated,  particularly  those  having  no  correlation  to  adjacent  estimates. 
The  coherence  criterion  suppresses  the  random  components  of  the  estimates 
of  phase  difference. 

The  coherence  spectrum  for  the  multiple  energy  sources  is  pre- 
sented in  Figure  7a..  The  coherence  is  generally  lower  than  that  of  the 
single-energy  source.  The  phase  difference  is  plotted  for  values  of 
coherence  greater  than  0.5  in  Figure  7b.  Comparison  of  Figure  7b  with 
Figure  5a  reveals  that  the  number  of  phase  differences  that  satisfy  the 
coherence  criterion  is  significantly  less  than  the  number  of  available 
estimates.  "Clusters"  of  three  or  four  estimates  satisfying  the  coherence 
criterion  occur  at  approximately  80,  90,  and  180  Hz,  corresponding  to 
lines  in  the  power  spectrum*  However,  other  phase  differences  also 
satisfy  the  criterion,  indicating  that  inspection  of  the  power  spectrum 
alone  does  not  necessarily  allow  one  to  identify  all  of  the  reliable 
estimates  of  phase  differences. 

Summary , Two  sets  of  data  have  been  analyzed  to  obtain  the  phase 
difference  between  signatures  for  subsequent  determination  of  time 
delays.  Random  instrument  noise  and  the  interaction  between  the  signa- 
tures of  multiple  energy  sources  have  been  hypothesized  as  potential 
causes  for  scattering  of  the  estimates  of  phase  differences.  Those 
estimates  of  phase  difference  useful  for  estimating  time  delays  have 
been  identified  by  applying  a criterion  on  coherence  to  the  estimates  of 

phase  difference.  The  coherence  estimate  is  easily  implemented  in  the 
conventional  approach  to  obtaining  estimates  of  phase  difference,  since 
the  supporting  parameters  (i.e,  3L  , Y , G , , etc.)  must  be  calculated 

in  order  to  obtain  the  phase  difference. 

*Ibid. , p 5. 
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Figure  1,  Geometric  relation  between  sensors  and  incoming 
ylane  waves 
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Figure  2,  Phase  differences  for  nondispersive  and  dispersive 
media 
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b.  Power  spectrum 


Figure  3 


Phase  difference  and  power  spectrum  for  single 
source  (Ml 13  APC) 
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Figure  4*  Physical  description  of  multiple  sources 
(M151  jeep  and  M35  truck)  study 
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Figure  S.  Phase  difference  and  power  spectrum  for  multiple 
sources  (Ml 51  jeep  and  M35  truck) 
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a.  Coherence  spectrum 
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Figure  6.  Coherence  spectrum  and  phase  difference  for 
single  source  (M113  APC) 
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ABSTRACT.  A new  relaxation  method  is  formulated  by  substituting 
artificial  time  dependence  for  true  time  dependence  in  the  Euler  equa- 
tions of  fluid  mechanics.  The  previously  hyperbolic  equations  are 
thereby  converted  into  parabolic  form,  and  the  convergence  question  is 
studied  analytically  within  the  vehicle  of  linearized  small  disturbance 
theory.  The  fully  nonlinear  equations  are  solved  numerically  using  this 
new  relaxation  technique  for  the  case  of  a NACA  0012  airfoil  at  2 angle 
of  attack.  The  method  is  seen  to  work  quite  well  and  warrants  further 
extension  into  the  transonic  range. 

INTRODUCTION.  The  problem  of  determining  the  flow  field  past: 
an  arbitrary,  two-dimensional  body  immersed  in  an  unbounded  compressible 
gas  has  been  the  subject  of  countless  investigations  over  the  last  half- 
century,  Most  of  these  analytical  methods  gave  reasonable  results  until 
the  free  stream  Mach  number  increased  to  the  point  that  a finite  super- 
sonic zone  usually  terminated  by  a shock  wave  appeared  on  the  body.  This 
transonic  flow7  regime  has  only  during  the  last  few  years  yielded  to  the 
attempts  of  many  researchers  to  develop  solution  techniques  general 
enough  to  be  utilised  as  design  tools,  These  successful  techniques  are 
primarily  numerical  relaxation  methods  applied  to  the  entire  flow  field 
equations , 

In  the  current  work  described  here,  the  fully  nonlinear  equation 
is  solved  utilizing  transformation  technique  for  handling  boundary  con- 
ditions, However,  the  relaxation  process  itself  is  quite  novel  and  the 
physical  motivation  described  in  deriving  the  relaxation  method  pre- 
sented i ; e r e ma  k e s it  quite  ‘possible  to  d e v i s e a r e .1  axa t i o n p r o c e s. s wh i ch 
converges  much  faster  than  any  technique  currently  being  used.  In  the 
sections  to  follow,  the  derivation  of  the  relaxation  method  will  be 
presented  followed  by  a discussion  of  the  question  of  convergence.  The 
application  of  this  technique  to  subsonic  flow  is  presented  in  Sections 
4 and  5,  and  results  are  discussed  in  Section  6, 
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2.  MOTIVATION  AND  DERIVATION  OF  METHOD.  The  selection  of  any 
particular  iterative  method  of  successive  approximation  for  finding 
the  solution  of  a set  of  nonlinear,  algebraic  equations  is  determined 
firstly,  by  the  conditions  for  convergence  of  the  method  and  secondly, 
by  the  speed  with  which  the  method  converges.  The  concept  of  utilizing 
the  fully  unsteady  equations  of  motion  to  obtain  steady-state  flow 
fields  is  particularly  intriguing  from  the  point  of  view  of  a relaxa- 
tion scheme  selection  since  one  can  argue  from  physical  experiments 
that  initially  unsteady  transonic  flow  fields  with  steady  boundary 
conditions  do  indeed  settle  down  to  steady-state  conditions.  Hence 
a relaxation  process  which  faithfully  describes  this  true  transient 
behavior  should  always  converge  to  the  desired  steady-state ' solutions . 

The  equations  to  be  solved  are  of  hyperbolic  type  and  hence  the  flow 
pattern  at  any  finite  instant  of  time  is  composed  of  a complex  pattern 
of  rarefaction  and  compression  waves.  The  numerical  description  of 
this  complex  process  together  x^ith  the  boundary  conditions,  even  though 
guaranteed  to  converge  if  the  difference  scheme  is  stable,  requires 
an  enormous  amount  of  real  computer  time,  i.e.,  the  speed  of  convergence 
is  slow* 

To  obtain  a relaxation  process  x^hich  converges  more  rapidly,  perhaps 
some  other  artificial  time  behavior  should  be  utilized  rather  than  the 
true  time  behavior  of  the  hyperbolic  Euler  equations  for  unsteady  flow* 
The  motivation  for  this  concept  is  that  hyperbolic  systems  approach 
their  asymptotic  steady  states  through  a series  of  expansions  and  com- 
pressions, i.e*,  through  a wave  system.  Now  if  one  substitutes  for 
this  true  time  behavior,  an  artificial  time  behavior  which  yields  a 
parabolic  set  of  equations,  then  the  resulting  solution  should  settle 
down  to  its  asymptotic  steady  state  through  a process  of  diffusion.  The 
attempt  here  is  to  draw  an  analogy  betx^een  heat  diffusion  processes 
which  are  governed  by  parabolic  type  equations  and  the  unsteady  fluid 
flow  equations  which  have  been  manipulated,  albeit  artificially,  to  be 
parabolic  equations.  Carrying  the  analogy  further,  since  heat  diffusion 
processes  have  been  observed  to  reach  their  asymptotic  steady  state 
rather  quickly,  exponentially  fast  in  some  cases,  it  then  night  be  ex- 
pected that  the  fluid  flow  should  also  approach  its  steady  state  more 
quickly  through  an  artificial  time-dependent  process  which  is  of  para- 
bolic character.  That  is  to  say,  a hyperbolic  or  wave-type  set  of 
equations  has  been  traded  for  a parabolic  or  diffusion-type  set  of 
equations.  . It  must  be  remembered,  however,  that  the  quantity  called 
time  in  the  parabolic  equations  can  no  longer  be  interpreted  as  a 
physically  meaningful  quantity'  but  takes  on  the  interpretation  of  an 
artificial  parameter  which  gauges  the  nearness  to  an  asymptotic  steady- 
state  solution.  The  desired  solution  is  obtained  when  dependent  flow 
variables  no  longer  depend  on  this  artificial  parameter.  At  this  point, 
it  is  reasonable  to  ask  if  these  parabolic  equations  converge  to  some 
steady-state  solution  and  if  the  solution  so  obtained  is  the  correct 
one*  This  question  will  be  discussed  in  Section  3.  To  specifically 
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define  this  new  relaxation  concept,  let  p = density,  q - velocity 
vector,  p = pressure,  Q - heat  per  unit  mass  and  time,  i = enthalpy, 
R = gas  constant.  Then  the  governing  equations  for  unsteady  flow  of 
an  inviscid  perfect  gas  past  an  obstacle  are 

(1)  “y-  + V.^q]  = 0 


(2)  — + (q.v)  q = - Vp 


<3>  al(i+iq2)+(q’v)(i+^2)  =^lf+Q 


(4>  p = p R T . 

The  boundary  conditions  for  an  obstacle  immersed  in  an  unbounded  fluid 
are 

(5)  qn  = 0 , q = 

where  q is  the  velocity  normal  to  body  surface  and  q^  is  the  velocity 
vector  at  infinity.  In  addition,  for  flows  with  circulation,  the  Kutta 
condition, 


where  q^  is  the  velocity  at  the  trailing  edge,  must  be  satisfied.  For 
free-stream  Mach  number  ranging  up  to  transonic  values,  the  flow  field  can 
be  assumed  to  be  both  isentropic  and  irrotational  at  least  to  second  order 
since  shock  waves,  if  present,  are  weak  and  therefore  produce  entropy 
jumps  of  third  order.  Eq.  (3)  is  consequently  replaced  by 

P 

(6)  — ==-  = CONSTANT 

P7 

Now  defining  i0  = stagnation  enthalpy,  = free-stream  speed  of  sound, 
h = i + q^/2  - i q , the  conservation  of  momentum  and  mass  becomes,  res- 
pectively, 
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(7) 


+ Vh  s 0 

at 


(8) 


ig.  _ lh_  _(7  -i)^.q)h  = a2  V*q 

at  at  'v  ' 00 

_ h-1) 
2 


The  expression  on  the  left  side  of  Eq.  (8)  lumps  all  the  time  de- 
rivatives  into  one  term,  which  can  be  written 


(9)  -2(q-v)h-  •—-(>  - l)(v  q)h  = -J(h) 


Now  if  distance,  velocity,  q,  time,  t,  and  h are  interpreted  as  having 
been  scaled  with  respect  to  some  characteristic  length,  L,  f ree-s tream 
velocity,  qkX,  characteristic  time,  L/q^ , and  q respectively,  and 
defining  M = f ree^stream  Mach  number, 


then  Eq*  (8)  becomes 


Vv'q  1]  v-q  + 


(10)  M2 2 


00 


+ "V  F(h)  = 0 


For  irrotational  flow,  q = V$,  and  then  from  Eq.  (7) 

h — *“3  <±>  / 9 1 . 
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Introducing  the  potential  function  in  Eq.  (10)  gives 


(10a)  J—  V2<£  - (v$  * v)  ?*** 

1 2 

ii^ 


The  method  of  parabolic  substitution  is  to  substitute  a simple  func- 
tion for  F ( -$t)/MK,2j  termed  S ($t) , which  yields  a parabolic  equation 
and  which,  since  steady-state  solutions  are  being  sought,  satisfies  the 
condition  S (0)  = 0. 

Within  these  two  restrictions  the  choice  for  S ($t)  is  arbitrary.  Of 
course  having  made  a choice  for  S,  the  question  of  convergence  as  time 
increases  of  the  resulting  artificially  unsteady  flow  to  steady-state 
conditions  arises. 

' 3.  THE  CONVERGENCE  QUESTION 


7 — 


— - y v2<*>  + 


+ F<-*0  ■- 0 


A simple  choice  for  S which  satisfies  the  above  two  requirements  is 

(11)  $(<Df)  = - <Df 


To  study  the  convergence  of  the  resulting  unsteady  solution  to  the  re- 
quired s teady-sta te  values  as  time  increases,  a small  disturbance 
analysis  is  made  for  symmetric  flow  past  a thin  two-dimensional  ob- 
stacle. This  problem  is  stated  as 


(12) 


(13) 


M, 


XX  + *Vy  - 7 


= Y(x)  f(t)  (|xj  < 2 , t >0) 


y ” o 

*x  > ^y 


as 


+ y 


00 


('*)  = fv)  = (’’*)  = 0 

' A =o  \ /t=  o ' /t  - o 


where  41  - perturbation  potential,  Y(x)  = surface  slope  of  obstacle  and 


(0  for  t < 0, 
= jl  for  t > 0. 
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The  above  problem  is  linear  and  exact  closed  form  expressions 
can  be  obtained  for  the  potential  function.  When  asymptotic  expansions 
are  made  for  large  t,  the  solution  yields 


x,y 


V steady  + °<f> 


and  therefore  as  t— *»,  the  approximate  solution  does  converge  to  the 
correct  steady-state  solution. 

To  investigate  another  choice  for  S (4^) , let 

(U)  S(4>,)  = 


which  also  satisfies  the  two  requirements  on  S.  The  problem  to  be 
solved  is  the  same  as  before  except  Eq,  (12)  is  replaced  by 


T' it"  V 


The  solution  of  this  equation  subject  to  conditions  (13)  can  also  be 
found  analytically  and  expanded  for  large  t to  give 


4>  = (<t>  ) , + 

x x steady 


0(-~2) 


(4  ) 

y steady 


t > y/2 


and  therefore  as  t — * the  approximate  solution  does  converge  to  the 

correct  steady-state  solution*  Since  it  is  of  interest  to  apply  this 
new  relaxation  procedure  to  transonic  flow  eventually,  the  convergence 
question  can  also  be  studied  within  the  vehicle  of  linearized  transonic 
small  disturbance  theory  together  with  the  substitution  of  Eq.  (11)* 
Hence  the  equation  is  changed  from  Eq.  (12)  to 


(’  " M“)  vxx 


yy  K -1 

where  K is  a positive  constant  representing  acceleration  at  some  point 
on  the  airfoil.  Again,  asymptotic  expansions  of  the  analytic  solution 
are  easily  found  to  be 

$ - ( <j>  ) + Q(^“  e at) 

x x steady  t 


<f>  = (6  ) - 4-  0( — o 

Yy  Kyy'  steady 


-at. 
e ) 
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where  a Is  a constant  and  hence  the  approximate  solution  is  seen  to 
converge  to  the  correct  steady-state  solution.  To  examine  this  question 
of  convergence  for  the  actual  nonlinear  equation  of  interest  requires 
a complete  numerical  analysis  using  high-speed  computers. 

4.  PREPARATION  FOR  NUMERICAL  ANALYSIS 


The  problem  to  be  discussed  in  the  remaining  part  of  this  paper 
is  the  numerical  solution  of  Eq.  (10a)  with  the  substitution  (11)  for 
a high-speed  flow  past  an  arbitrary  two-dimensional  body.  To  enable 
an  accurate  numerical  description  of  the  boundary  conditions  a con- 
formal transformation  is  made  from  the  physical  flow  plane  Into  the 
interior  of  a unit  circle,  hereafter  called  the  computation  plane.  In 
doing  thusly,  the  arbitrary  body  surface  is  transformed  onto  the  unit 
circle  In  the  computation  plane  and  infinity  in  the  physical  plane  is 
transformed  into  the  center  of  the  circle  in  the  computation  plane. 

A cylindrical  polar  coordinate  system  r,  0 is  utilized  in  the  computa- 
tional plane  so  that  r = 1 corresponds  to  the  body  surface  and  r = 0 
corresponds  to  infinity.  As  an  extra  advantage,  this  mapping  procedure 
distributes  mesh  points  more  densely  in  those  regions  in  the  computa- 
tion plane  where  flow  acceleration  is  the  greatest,  thus  allowing  for 
better  flow  resolution  in  those  regions  where  variables  are  changing 
the  most.  For  some  body  contours,  such  as  Joukowsky  or  Karmdn-Tref f tz 
airfoils,  this  transformation  can  be  computed  analytically,  however, 
for  arbitrary  bodies  the  method  of  D.  Catherall,  D.  N.  Foster  and 
C.  C.  L.  Sells  [1]  for  calculating  the  transformation  has  been  faithfully 
followed  in  this  report. 


In  the  general  case  of  flows  with  circulation,  the  potential  func- 
tion has  a dipole  singularity  at  r “ 0.  Hence  by  comparison  with  in- 
compressible flow,  it  is  seen  that  if  $ is  expanded  about  r = 0 in  the 
computation  plane,  the  form  has  to  be 


(16) 


COS(a  + a,  + 0) 

<J>  ~ + L,(0)  + 0(0 


where  a = angle  of  attack  with  respect  to  x-axis  (positive  counter 
clockwise),  = flow  direction  with  respect  to  x-axis  for  zero  lift 
in  Incompressible  flow  (positive  clockwise) , L. (0)  = unknown  function 
of  0. 

Hence,  to  determine  Li(0) , the  transformed  equation  for  $ is  expanded 
about  r = 0 and  the  highest  two  orders  are  retained.  The  result  is  that 


tan  (a  + a(  + 
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where  E is  a constant  to  be  determined  from  the  Kutta  condition  at  the 
trailing  edge  and 

tan  V,  ~ tan  (a  + a,  + 0)] 

is  determined  to  have  the  same  quadrant  as  oi  + + 0. 

A disturbance  potential,  x>  is  now  introduced  by  the  definition 

X/  I cos  (a  + a.+  0) 

(17)  X = + E + 0 + a,) 

L -1 

+ r cos  (a  + a,  + 0) 

In  terms  of  x»  the  boundary  conditions  at  r = 1 and  r * 0 are 

<18>  *rO»«,*)-0 


U9)  x(0,*,t)  = L,(8)  - E(»  + a + a,)  . 


By  initially  selecting 


E = 2 sin  (a  + a^) 


and 

then 

(20) 


$(r,0,O)  = 4>. 


incompr 


X(r ,0,0)  = 0. 


As  previously  pointed  out,  E is  determined  by  the  Kutta  condition  at  the 
airfoil  trailing  edge  located  at  r = 1,  0 = 0.  This  condition  requires 

[Xe]r-1,0-O  "E  + 2 sln  <“  + “l> 
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or 

(21)  Eft)  = -XgU.O.t)  + 2 sin  (a  + c^) 

and  hence  E can  be  updated  at  each  time  level  as  the  solution  proceeds. 
The  initial  and  boundary  conditions  have  now  been  put  in  a form  easily 
expressed  by  numerical  techniques.  However,  since  a potential  func- 
tion analysis  has  been  used,  there  will,  in  general,  be  a branch  cut 
in  the  computation  plane  defined  by  0 “ 0,  0<_  r <_  1.  Hence,  when 
computing  G-derivatives  along  G = 0,  central  differences  cannot  be  used 
but  must  be  supplanted  with  second  order  accurate  forward  or  backward 
differences  along  the  rays  0 = 0,  9 = 2tf,  respectively.  Since  pres- 
sure must  be  continuous  across  this  branch  cut  the  velocity  components 
on  0 = 0 must  be  the  same  as  they  are  on  0 =-  2tt  for  a given  r.  Hence 
along  the  branch  cut  one  can  write  2N  + 2 equations , where  N t is  the 
number  of  mesh  points  in  the  radial  direction  excluding  r - 0,  for 
2 N +2  values  of  x*  This  system  of  linear  algebraic  equations  can 
be  straightforwardly  solved  by  method  of  elimination  to  establish 
the  values  of  x along  the  ray  9 = 0 and  the  ray  0 = 2 tt  in  terms  of 
neighboring  x values.  The  two  rays,  0=0  and  G - 2tt  now  are  used  as 
two  additional  boundaries  to  the  numerical  scheme. 


The  appropriate  equation  which  now  must  be  solved  at  the  interior  mesh 
points,  0 < r < 1 and  0 < 0 < 2tt,  is  the  transformed  equation  (10a) 
with  the  substitution  of  (11)  for  the  artificial  time  dependence  and 
the  introduction  of  x through  Equation  (17).  This  complicated  equation 
takes  the  form 


r xrr+  xee+  rxr 


+ M^(Axrr+  + Cxeg+  ^ - Xt 

where  @ = transformation  modulus  and  is  a function  of  r}  0, 
A = + (a2  + a3E)E  + (a^  + a5'/r)xr  + 

+ (a6  + ayE  + a8x0-)x0, 

B = (bl  + b2Xr)  (b3  + E + XG) ’ 

C = + c2(E  + x0)  + c3(E  + xG)2  + (c4  + csXr)Xr, 

D = (d  + d E + d E2)x  + (d,  + d E + d^E 2)\r  + 
i 2 3 r <+  b 6 9 

+ (d7  + ci8E)>;r2  + (d,  + d10E)xe2  + 

+ <dll  + + *1}\\  * dlixo\  + 

+d15Xr3  + dlf)(xe3  + E3)  + d17  + d18E  + d15E2, 
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and  a^,  b^,  and  d^  are  functions  of  rf  0 as  determined  by  the  com- 
puted conformal  transformation, 

5.  NUMERICAL  METHOD. 

Eq.  (22)  is  now  differenced  in  the  classic  manner  for  parabolic 
equations.  Second  order  accurate  centered  differences  are  used  for 
spatial  derivatives  and  the  first  order  accurate  Euler  difference  is 
used  for  the  time  derivative.  Truncation  error  in  the  time  differencing 
formula  is  of  no  concern  since  the  solution  is  found  when  time  derivatives 
are  zero.  The  boundary  conditions  given  by  Eqs.  (18)  and  (19)  are  im- 
posed by  using  second  order  accurate  backward  differencing  for  x at 
r = 1 and  second  order  accurate  forward  differencing  for  Xq  at  r = U 
0-0.  The  resulting  system  of  equations  advances  the  value  for  x from 
t to  t + At.  Eq.  (20) establishes  the  initial  values  from  which  the 
relaxation  procedure  begins.  Once  the  values  are  advanced  from  t to 
t + At  throughout  the  entire  interior  mesh,  and  on  the  boundaries,  E is 
evaluated  from  Eq.  (21)  and  the  iteration  is  ready  to  advance  another 
time  step.  The  time  step  is  selected  at  each  time  level  based  on  a 
heuristic  approach  to  the  stability  question.  By  analogy  with  the  two- 
dimensional  diffusion  equations  in  Cartesian  coordinates  studied  by 
R.  D.  Rlchtmyer  [2],  the  stability  criterion  for  the  current  problem 
was  selected  as 


(r2<&2  + M 2 A) 

00 


-—■  + fe2  + M^C) 
Ar 


However,  by  running  several  numerical  experiments  it  was  discovered  that 
stability  could  still  be  maintained  if  the  1/2  were  increased  to  0*61, 
Spatial  mesh  widths  were  defined  by  selecting  60  equally  spaced  rays 
with  10  equally  spaced  points  on  each  ray,  not  including  the  point  r = 0. 

6.  RESULTS 


The  above  described  numerical  method  has  been  programmed  and  run 
on  a UNIVAC  1106  for  the  cases  of  symmetric  flow  past  a Joukowskv  profile, 
lifting  flow  past  an  8,57%  cambered  Karman-Tref f tz  airfoil  of  10°  trailing 
edge  angle,  and  a MCA  0012  airfoil  at  2°  angle  of  attack.  The  latter 
solution  is  shown  in  Fig.  1,  The  iteration  converged  in  all  three  cases 
for  purely  subsonic  flow  and  sufficient  accuracy  was  obtained  after 
about  20  minutes  of  computer  run  time.  For  fully  nonlinear,  compres- 
sible flow  past  a Joukowsky  or  Kannan-Tref f tz  airfoil,  other  results 
for  comparison  with  the  current  method  could  not  be  found  in  the 
literature.  In  Fig.  1,  the  results  are  compared  with  the  numerical 
solution  of  Sells  given  by  R,  C . Lock  [3].  The  method  can  therefore 
be  seen  to  work  quite  well  and  warrants  further  extension  into  the 
transonic  range. 
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7.  TRANSONIC  FLOW  CONSIDERATIONS 


To  accomplish  this  task,  the  substitution  form  given  by  Eq.  (14), 
modified  to  account  for  the  fully  nonlinear  potential  equation,  should 
be  used  to  greatly  reduce  machine  running  time.  The  modification  comes 
about  because  the  coefficient  of  I’yy  in  Eq.  (10a)  is  now  a function  of 
both  dependent  and  independent  variables  whereas  in  the  small  disturbance 
theory,  the  coefficient  is  unity. 

Thus  the  modification  is  derived  by  adding  f,.$yt  and  (^/^^tt 
to  f$yy  where  f,  f-^,  f2  are  functions  of  space  ana  velocity,  such 
that  a parabolic  equation  results.  Thus, 


or  then 


and,  since  f is  known  from  (10a),  one  has  an  arbitrary  choice  to  make 
for  f^  or  f^.  Suppose  one  chooses  f£  * f,  then  the  substitution  term 
becomes  * 

S<V  - f(i  »tt  * v). 

Another  possibility  would  be  to  choose  ■ 1/f,  then  the  substitution 
term  becomes 

S(V  -l(i  + 

The  numerical  difference  scheme  for  spatial  differences  must  change 
when  the  flow  field  becomes  supersonic  locally.  Hence,  an  additional 
refinement  in  the  method  must  be  made  so  that  the  Mach  number  at  each 
mesh  point  is  determined  and  tested  to  see  if  the  flow  is  supersonic 
or  not.  For  supersonic  points  the  space  differences  in  the  approximate 
streaming  direction  must  be  backward  differences,  not  central  dif- 
ferences as  they  are  in  the  subsonic  portion.  For  differences  in  the 
y-direction,  the  change  over  in  difference  scheme  is  not  necessary. 

The  difference  scheme  can  also  be  changed  to  gain  some  artificial 
viscosity  which,  in  addition  to  stability,  furnishes  an  entropy  in- 
equality on  the  solution  obtained.  Shock  waves  should  occur  naturally 
as  the  solution  progresses. 
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THE  BACKWARD  BEAM  EQUATION  AND  THE 
NUMERICAL  COMPUTATION  OF  DISSIPATIVE 
EQUATIONS  BACKWARDS  IN  TIME  t * 

Alfred  Carasso* 

1,  Introduction 

The  purpose  of  this  paper  is  to  survey  the  main  results  of  [2-6], 
dealing  with  the  development  of  a new  algorithm  for  the  approximate  solu- 
tion of  backwards  parabolic  equations.  The  exposition  is  mostly  without 
proofs,  and  the  reader  is  referred  to  the  original  papers  for  a more  de- 
tailed discussion.  Since  this  work  was  begun,  a substantial  amount  of 
computational  experience  has  been  accumulated.  The  method  appears  to 
be  a powerful  tool.  Some  computational  experiments,  undertaken  by 
B,  L.  Buzbee  at  the  Los  Alamos  Scientific  Laboratories,  have  not  been 
published.  They  are  mentioned,  in  Section  3,  More  recently,  [6],  the 
method  was  extended  to  an  interesting  example  of  a nonlinear  equation, 
Burgers'  equation,  and  extensive  numerical  experiments  were  carried  out 
on  problems  for  which  exact  solutions  are  known.  In  Section  5,  we  de- 
scribe some  of  the  results  obtained  in  [6].  Further  applications  to  non- 
linear problems  art  currently  under  way, 
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and  the  United  States  Army  under  Contract  No,  EA-31-124-ARQ-D-462, 
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May  1974.  ' 45g 


2,  Some  general  remarks  on  backwards  dissipative  equations 

We  shall  focus  attention  on  a special  but  quite  interesting 
class  of  ill-posed  problems,  namely  the  problems  which 

arise  when  the  time  direction  is  reversed  in  a dissipative  evolution 
equation.  Such  equations  distinguish  a time  direction  as  they  describe 
irreversible  phenomena.  While  the  problem  of  determining  the  future  from 
the  present  is  well  understood  analytically,  and  a considerable  literature 
exists  which  deals  with  the  effective  numerical  computation  of  such 
forward  problems,  the  same  is  not  true  of  the  backwards  problem.  Attempt- 
ing to  reconstruct  the  past  from  the  present  leads  to  tremendous  difficulties, 
as  the  solutions  depend  discontinuously  on  the  data,  and  this  in  an 
essential  way.  That  is,  continuous  dependence  cannot  usually  be  re- 
stored by  reconsidering  the  question  in  some  other  metrizable  linear  space. 
What  seems  necessary  to  restore  continuity  is  the  imposition  of  additional 
constraints  on  the  class  of  admissible  solutions,  such  as  requiring  the 
solutions  to  be  positive  or  to  satisfy  an  a priori  bound.  On  the  other  hand, 
while  the  constrained  problem  is  well  posed  in  the  analytical  sense,  there 
remains  the  question  of  devising  effective  numerical  methods  in  which  the 
constraints  can  be  incorporated,  if  one  wishes  to  approximate  the  solutions. 
Thus,  the  standard  stable  marching  schemes  (such  as  the  Crank-Nicolson 
or  other  Pade  approximants)  which  have  been  widely  successful  in  the 
numerical  computation  of  forward  dissipative  problems,  are  necessarily 
unstable  when  the  time  direction  is  reversed.  This  is  a general  theorem 
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to  be  found  in  [17,  p.  59].  What  is  missing  in  these  classical  schemes 
is  a means  of  incorporating  the  constraints.  In  recent  years,  it  has  be- 
come apparent  that  the  essential  difficulties  in  the  above  class  of  ill- 
posed  problems  are  algorithmic  in  character.  By  and  large,  the  numerical 
analysis  of  these  problems  is  not  as  well  developed  as  are  the  analytical 
questions  such  as  backwards  uniqueness  and  stability  under  a prescribed 
bound.  Nevertheless,  a great  deal  of  effort  and  ingenuity  has  been  applied 
towards  the  effective  numerical  computation  of  such  problems;  we  refer  to 
[16]  for  an  extensive  bibliography,  and  to  the  paper,  by  Miller  in  the  present 
volume  for  a viewpoint  somewhat  different  from  ours. 

It  should  be  pointed  out  that  not  all  methods  which  have  been  pro- 
posed are  equally  effective  or  applicable.  Thus,  methods 
which  require  exact  knowledge  of  the  data  are  of  limited  usefulness  in 
applications.  Even  when  an  analytical  expression  is  available  for  the 
data,  round-off  error  in  digitizing  that  data  plays  a considerable  role  and 
its  effect  cannot  be  ignored.  The  same  is  true  for  schemes  which  require 
the  data  and  corresponding  solution  to  have  a Fourier  transform  with  com- 
pact support.  As  the  spatial  mesh  is  refined,  extraneous  high  frequencies 
are  injected  into  the  solution  by  the  rounding  process.  These  high  fre- 
quencies may  be  amplified  without  bound,  in  arbitrarily  small  time  intervals, 
as  time  evolves  backwards.  Furthermore,  if  a smoothing  process  is  used 
at  each  time  step,  it  may  not  be  possible  to  decide  which  parts  of  the 
spectrum  should  be  filtered  out.  Thus,  in  nonlinear  problems,  or  even  in 
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linear  problems  with  variable  coefficients,  there  may  be  genuine  inter- 
actions between  different  frequency  bands.  Several  methods  which  have 
been  proposed  (for  linear  problems)  begin  by  recasting  the  problem  in 
the  form  of  an  integral  equation,  i.  e. 

(2.1)  S(T)u  = v 

where  v is  the  given  terminal  data  at  time  T,  u is  the  desired  initial 

data,  and  S(T)  is  the  solution  operator  at  time  T in  the  forward  analytic 

problem.  Since  [S(T)]  ^ is  unbounded,  various  regularization  techniques 

are  then  employed  to  approximate  u . The  difficulty  here  is  that  except 

in  very  simple  problems,  one  generally  does  not  know  S(T)  explicitly. 

Thus,  if  the  equation  has  variable  coefficients  depending  on  time,  one 

rarely  has  formulae  for  the  fundamental  solution.  It  is  a happy  fact  that 

many  stabilized  backwards  dissipative  problems  enjoy  the  property  of 

Holder -continuity  with  respect  to  the  data  on  compact  subintervals  of 

(0,  T]  . Usually,  the  exponent  g.(t)  is  a function  of  t which  tends  to 

zero  as  t i 0 . As  a simple  example,  in  linear  problems  with  a self 

adjoint  operator  one  can  give  a sharp  estimate  for  the  error  at  time  t , 

corresponding  to  an  error  6 at  time  T,  given  an  a priori  bound,  M,  for 

2 

the  initial  data.  One  has  in  the  L norm, 

T-t  t_ 

(2.2)  ||  e(t)  ||  < 2 M T 6T  , , 0 < t < T . 

Here  p(t)  = and  there  is  "destruction  of  information"  as  t i 0 . This 
loss  of  information  is  more  severe  than  that  which  takes  place  in  the  usual  well 


posed  linear  problems  of  mathematical  physics.  On  the  other  hand,  the 
situation  is  far  worse  in  other  classes  of  stabilized  ill -posed  problems. 


Thus,  in  [9],  F.  John  gives  an  example  where  a much  weaker  type  of 

continuity,  "logarithmic  continuity"  can  be  shown  to  actually  hold.  In 

that  example,  the  data  must  be  known  to  an  accuracy  of  10 

-3 

in  order  to  produce  an  accuracy  of  10  in  the  corresponding  solutions  ! 
An  important  requirement  for  a numerical  method  is  that  it  should  preserve 
the  Holder  dependence  inherent  in  the  analytic  problem.  This  requirement 
is  stronger  than  stability.  The  latter  simply  requires  that  round-off  or 
other  errors  in  the  data  not  be  amplified  without  bound  in  finite  time; 
however,  stability  alone  may  not  prevent  the  scheme  from  amplifying 
errors  in  the  data  far  beyond  the  theoretical  limit  set  by  the  estimate 
(2.2),  resulting  in  an  unwarranted  loss  of  precious  information.  Thus  in 
[8],  an  interesting  example  is  given  of  a stable  marching  scheme  for  the 
backwards  heat  equation,  in  which  logarithmic  continuity  with  respect  to 
the  data  actually  holds.  Similarly,  [18],.  Tichonov's  method  for  the  back- 


ward heat  equation  is  only  logarithmically  continuous  with  respect  to  the 
data. 


Even  at  the  analytical  level,  there  remain  genuine  difficulties  in 
obtaining  a priori  stability  estimates  in  backwards  dissipative  problems. 
Such  estimates  are  of  fundamental  importance  as  they  measure  the  rate  at 
which  information  is  destroyed  as  time  evolves  backwards.  To  illustrate 
this  point,  consider  the  situation  for  the  Navier-Stokes  equations  back- 
wards in  time.  In  [12]  the  authors  consider  the  class  of  smooth  solutions 
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of  the  Navier  Stokes  equations,  in  a space  time  domain  n x[0,  T] 
satisfying 

(2 . 3)  sup  { I u | 2 + | curl  u | 2 + | u | 2 } < N2  , 

(x,  t)e  nx[0,  T] 

where  N is  a given  a priori  bound.  Let  v be  the  kinematic  viscosity, 

V the  volume  of  £2,  and  let  u^x,  t),  u2(x,  t)  be  two  smooth  solutions  of 
the  Navier-Stokes  equations,  satisfying  (2.  3)  in  S2  X [0,  T],  and  such 
that, 

(2.4)  llu^-,  T)  - u2(-,T)||22  < 6 . 


It  is  shown  in  [12]  that  then,  for  0 < t < T , 


(2.5)  || u (* , t)  - u (•  ,t)||2  < (4N2)1_|a(t)  6^(t)  exp 

L (0) 


N (t-u(t)T)| 

2 
v 


where  |x(t)  is  given  by 


(2.6) 


H-(t)  = 


1 - exp 

2(N2+l)t 

u v 

1 - exp 

2(N2+1)T 

— V w 

The  estimate  (2.  5)  establishes  Holder -continuous  dependence  on  the  data 

and  implies  backwards  uniqueness  of  smooth  solutions.  On  the  other  hand, 

as  far  as  computing  the  solutions  are  concerned,  (2.  5)  is  rather  discon- 

-1  -50 

certing.  For  example,  if  V = T = N = 1,  v = 10  , and  6=10  , we  have  from 

(2. 5)  at  t = T/2  = 1/2, 


(2.7)  ||  u (- , 1/2)  - U (-,1/2)11*,  < 4e5°(10"5°)e  « 10 

L (£2) 


-20 


22 
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Moreover,  (2.  7)  is  little  changed  by  choosing  6 = 10  , since  the  rate 

at  which  jj.(t>  | 0 is  so  large,  even  at  such  small  Reynolds  numbers. 

It  is  not  known  whether  the  exponent  p(t)  is  sharp,  or  whether  constraints 
different  from  (2.  3),  possibly  involving  other  combinations  of  derivatives, 
would  result  in  a more  encouraging  estimate.  In  non-linear  problems,  a 
major  task  appears  to  be  that  of  isolating  classes  of  equations  for  which 
one  can  obtain  fairly  reasonable  Holder  estimates.  As  a very  small  be- 
ginning, the  one  dimensional  Burgers'  equation, 

(2.8)  u = vu  - uu  + f(x,  t),  0<x<L,  0 < t < T , 

L A-X  A. 


is  considered  in  [6].  It  is  straightforward  to  show  that  if  u^(x,  t), 


u2(x,  t),  are  two  solutions  satisfying 


(2.9)  MaxtluJ,  |u1)tl,  lui)ttl,  lu1(Xtl)<N,  1 = 1,2, 


for  (x,  t)  € [0,  L]  X[0,  T],  and  if 


(2.10) 

then  for  0 < t < T , 


lUjKT)  - u2(* , T)  II  2 < 6 , 
L 


T-t  t 
T , T 


(2.11)  || u, ( • , t)  - u (■ , t)  ||  < 2 K(t)  N 6 , 

1 2 L 


where 


4NL  + t(T-t){(NL)' 
4v 


+ (H3v)NL}j 


(2. 12)  K(t)  - exp 

In  [6],  extensive  numerical  experiments  are  presented  for  the  final  value 
problem  for  Burgers'  equation.  Some  of  these  results  will  be  described  in 
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:he  present  paper  later  on.  (See  Section  5).  One  of  the  points  suggested 
by  these  experiments  is  that  the  factor  K(t),  which  involves  the  Reynolds 
number,  appears  to  play  a significant  role  only  when  the  solutions  to 
Burgers'  equation  develop  steep  gradients  approaching  a "shock".  For 
more  reasonable  solutions,  one  finds  that,  even  in  single  precision,  i.  e. 
with  a unit  round-off  error  of  about  10"  , one  can  often  attain  significant 
accuracy  at  90%  of  the  way  back  from  T = 1 . Thus,  the  difficulty  of 
reconstructing  past  steep  gradients  from  future  smoothed  data  appears 
to  be  explained  by  the  factor  K(t)  in  (2.11).  Conceivably,  in  the  case 
of  the  Navier-Stokes  equations,  the  similar  exponential  factor  in  (2.  5) 
may  alone  suffice  to  account  for  the  difficulty  of  reconstructing  steep 
gradients,  and  in  some  suitable  norm,  there  might  well  exist  a Holder 
estimate  in  which  p(t)  is  independent  of  the  Reynolds  number,  and  decays 
linearly  with  t . 
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3.  The  backward  beam  equation  approach  in  self-adjoint  problems  with 
time  independent  coefficients 

We  shall  now  describe  a new  method, which  was  recently  developed 
in  [2],  for  computing  linear  self-adjoint  parabolic  equations  backwards 
in  time.  In  this  section  we  consider  the  case  where  the  spatial  operator, 

A,  is  independent  of  t . The  case  where  A depends  on  t is  more  subtle 
and  is  discussed  in  Section  4. 

2 

Let  f(x)  be  a given  function  in  L (0),  where  ft  is  a bounded  domain 
N N 

in  R , in  R with  a smooth  boundary  Bft  . Let  A be  a non- 

2 

negative  self-adjoint  operator  in  L (ft);  in  the  concrete  cases,  A 
is  the  unbounded  operator  corresponding  to  a self-adjoint  elliptic  boundary 
value  problem  in  ft,  with,  say,  zero  Dirichlet  data  on  Bft  . Given  the 
positive  constants  6,  M,  T,  we  consider  the  following  problem. 

Find  all  solutions  of 

(3. 1)  ut  = -Au,  0 < t < T , 
such  that 

(3.2)  II u ( * , T)  - f II  < 6 , 
and 

(3.  3)  II u ( • , 0)||  < M . ■ 

To  solve  this  stabilized  backwards  problem,  consider  the  following  device. 
Set 

1 M 

(3.4)  k=~log(y) 
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kt 

and  then  put  v = e u in  (3.1).  This  leads  to 
(3.  5)  v = -(A-k)v,  0 < t < T , 

(3.6)  II  v( * , T)  - ekTf||  < ekT6,  ||v(0)||  < M . 

Differentiating  (3.  5)  with  respect  to  t,  we  obtain  the  "backward  beam 
equation"  associated  with  (3.1),  namely, 

(3.7)  v = Bv  , 0 < t < T , 

2 

where  B * (A-k)  . Thus,  B is  a non-negative  self-adjoint  operator  in 

2 

L (ft);  in  particular,  B is  "m -accretive".  For  such  equations,  it  is  easy 

to  show  that  solutions  are  norm -convex.  We  have, 

,2 

(3.8)  — =-  llv(t)||  = 2 llv'(t)  ||  + 2 Re(Bv,  v)  > 0 . 
dt2 

In  particular,  if  v(t)  is  a solution  of  (3.7), 

(3.9)  II v(t) ||  < ^rl|v(0)||  + — llv(T) ||  . 

The  last  inequality  suggests  that  the  "initial -terminal  value"  or 
"two-point"  problem  is  well-posed  for  (3.7),  i.  e.  data  should  be  pre- 
scribed at  t = 0 and  at  t * T . Using  Hadamard's  classical  example  of 
the  Cauchy  problem  for  Laplace's  equation,  it  is  easily  shown  that  the 
initial-value  problem  is  in  general  ill-posed  for  (3.7).  It  also  follows 

from  the  spectral  representation  of  B,  that  solutions  to  the  two  point 

2 

problem  exist,  for  arbitrary  data  v(0),  v(T)  in  L (ft);  moreover,  as 
shown  in  [5],  (3.7)  has  the  "smoothing"  property.  That  is,  if  A has 
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sufficiently  smooth  coefficients,  arbitrarily  high  Sobolev  norms  of  the 

2 

solution  at  time  t,  0 < t < T,  can  be  estimated  in  terms  of  the  L 
norms  of  the  data  v(0),  v(T)  . In  a very  real  sense,  (3.  7)  is  an  "elliptic" 
equation  in  Hilbert  space. 

Let  w(-,t)  be  the  unique  solution  of 

(3.10)  w = Bw,  0<t<T, 

kT 

(3. 11)  w(0)  = 0 w(T)  = e f . 


We  then  have 
Theorem  3. 1 

Let  u(’,t)  be  any  solution  of  the  stabilized  backwards  problem 
(3.1)-(3.  3).  Let  k be  as  in  (3.4),  and  let  w(-,t)  be  the  unique  solution 
of  (3.10)-(3.11).  Then, 

T-t  t_ 

(3.12)  ||e'ktw(- , t)  - u(*  , t)||  < M T ST 

Moreover,  if  A has  smooth  coefficients,  N is  the  dimension  of  ft,  q 

N 

is  a positive  integer,  and  2 cr  > — + q,  there  is  a constant  K such  that 
for  0 < t < T , 


(3.13) 


Max  ||D^u(*,t)  - e ktD^w(* , t)  ||^  < 

lpl<q 


00  — 


log(^f) 

K((t)-ir  + (T-t)_or  + 


T-t  t_ 

T T 
M 6 
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Proof : 


kt 

If  u is  any  solution  of  (3. 1)  -(3.  3),  v = e u satisfies  (3.  7)  and 
the  inequalities  (3.6).  Let  z(- , t)  = v - w . Then,  from  (3.9)  and  (3.6), 
we  have, 

-kt  t k(T-t) 


(3.14)  ||e"ktz(.,t)||  < ~ Me 


+ T 6 


6 = M 


T-t  t 
T , T 


This  proves  (3.12).  The  proof  of  (3.13)  is  more  complicated  and  we  refer 
the  reader  to  [5]. 

Remark  1.  Since  the  estimate  (2.  2)  is  sharp,  for  the  difference  of  any 

^■Ict 

two  solutions  of  the  backwards  problem,  it  follows  that  e w above, 

is  a "be st -possible"  L approximation  to  any  solution  of  the  backwards 

problem.  Moreover,  even  though  the  data  f(x)  is  an  approximation  to 
2 “kt 

u(* , T)  in  the  L -norm,  e w approximates  the  solutions  of  the  back  - 

00 

wards  problem,  together  with  their  derivatives,  in  the  L norm,  on 
0 < t < T . 

Remark  2.  By  using  (3.12)  and  the  triangle  inequality,  we  obtain  an 

independent  proof  of  the  convexity  estimate,  (2.  2),  for  the  difference  of 

any  two  solutions  of  the  backwards  problem.  Similarly,  (3.13)  and  the 

triangle  inequality  lead  to  a maximum  norm  stability  estimate,  for  the 

2 

derivatives  of  any  two  solutions,  in  terms  of  the  L norm  of  the  data. 

In  actual  numerical  computation  of  the  solution  of  (3. 10) -(3. 11),  the 
time  variable  is  discretized  using  a centered  time  discretization.  With 
T = (N+l)at,  we  have, 


(3.15) 


n+1  n n-1 
w - 2w  + w _ n 

= Bw  , 

At 


n = 1,  2, 


, N , 


w - 0 


N+1  kT 
w = e f . 


(3.16) 
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This  system  of  linear  equations  can  be  written  in  tridiagonal  matrix 
form,  with  an  unbounded  operator  along  the  main  diagonal.  See  [3],  [4], 

[2],  Using  the  tridiagonal  algorithm,  one  obtains  the  existence  and 
uniqueness  of  solutions  to  (3.  15),  (3.16),  together  with  the  basic  norm- 
convexity  property  (3.  9),  for  the  solution  of  this  finite  difference  analog; 
the  proof  uses  only  the  "m-accretiveness"  of  B . Moreover,  this  method 
of  proof  provides  one  of  several  possible  algorithms  for  the  solution  of 
this  system  of  linear  equations.  To  discretize  the  spatial  operator  B , 
one  uses  finite  difference  analogs  of  the  elliptic  operator,  or  Galerkin 
methods  using  trial  functions  satisfying  the  boundary  conditions.  Either 
method  preserves  the  " m-accretiveness"  of  B.  Consequently,  the  fully  - 
discrete  scheme  is  unconditionally  stable  and  has  the  property  (3.  9). 
Finally,  estimates  such  as  (3.12),  (3.13),  supplemented  by  the  truncation 
error  at  time  t in  the  fully-discrete  scheme,  remain  valid  for  the  solution 
of  the  fully  discrete  problem.  We  refer  the  reader  to  [2]  for  a more  de- 
tailed discussion  of  these  matters.  By  and  large,  the  numerical  analysis 
of  (3.10),  (3.11),  is  very  similar  to  that  for  elliptic  boundary  value 
problems,  for  which  an  abundant  literature  exists.  Thus,  direct  or  iterative 
methods  may  be  used  to  solve  the  system  of  linear  equations.  In  [2],  a 
computational  example  is  discussed  in  detail  for  a one  dimensional  problem. 
Further  examples  will  be  given  later  in  the  present  paper,  when  we  discuss 
the  backwards  problem  for  Burgers'  equation.  (See  Section  5).  Finally, 
we  mention  some  successful  computations,  on  two-dimensional  problems 
in  rectangular  regions,  carried  out  by  B.  L.  Buzbee  at  the  Los  Alamos 
Scientific  Laboratories. 
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4.  Self-adjoint  problems  with  time  dependent  coefficients 


An  important  feature  of  the  backward  beam  equation  approach  is 
that  the  method  is  applicable  to  self-adjoint  parabolic  equations  with 
smooth  time  dependent  coefficients.  The  assumption  of  smooth  dependence 
on  time  is  important,  as  backwards  uniqueness  may  fail  for  non- smooth 
coefficients.  See  the  example  of  Miller  in  [14]. 

For  each  t > 0,  let  a(t;u,  v)  be  the  symmetric  bilinear  form  on 
H™(f2)  given  by 

(4. 1)  a(t;u,  v)  = V f a (x,  t)  Dqu  DPv  dx 

|p|,nq|<m  U p 


where  the  a depend  smoothly  on  x and  t,  and 


(4.2) 


We  assume  a(t;u,  v)  to  be  strongly  coercive  on  HQ  (0),  i,  e, 
there  exists  a positive  constant  w,  independent  of  t,  such  that 

(4.3)  a(t;V, V)  > « llvll^,  VV«Hq(J2)  , 


where  II  ||  denotes  the  Sobolev  norm.  Let  a(t;u,  v)  be  the  bilinear 

g 

form  obtained  from  a(t;u,  v)  by  replacing  a by  a = ~ a . The 

' ’ ’ 7 pq  ' pq  9t  pq 

form  a (t;u,  v)  will  play  an  important  role  in  the  subsequent  discussion, 

2 

Let  P(t)  be  the  unbounded  self-adjoint  operator  in  L (fi)  defined  by 
a(t;u,  v),  i.  e. , 
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2 

where  ( , ) is  the  scalar  product  in  L (£2)  . An  integration  by  parts 

shows  that  P(t)  corresponds  to  the  self-adjoint  elliptic  partial  differ- 
ential operator, 


(4.  5) 


v>  = l 


0'“'  u (-1)IPI  DP(a  (x,t)Dqu),  X€fl,  t>0  , 
lp|,|ql<m  pq 


together  with  the  Dirichlet  boundary  conditions, 


(4.6) 


D*u  = 0 on  I p I < m - 1,  t > 0 . 


We  shall  consider  the  parabolic  problem 

(4.7)  u = -P(t)u,  t > 0 . 

and  we  note  that  from  (4.6),  the  domain  of  P(t)  is  fixed  as  t varies. 
We  now  introduce  the  following  definitions. 

Definition 

The  parabolic  problem  (4.7)  is  minimal -smoothing  on  [0,  T]  if 

(4.8)  a (t;  v,  v)  < 0 We  V,  0 < t < T . 

It  is  strongly  - smoothing  if 

(4.9)  a (t;  v,  v)  < 2y  ||v||2,  y>0  . 

where  ||  ||  denotes  the  L norm.  We  say  that  (4.7)  is  maximal  - 


smoothing  if 


(4.10) 


a(t;  V,  v)  < a ||  V 11^, 


a > 0 . 


The  above  definition  distinguishes  three  broad  classes  of  problems. 
Further  refinements  are  clearly  possible.  In  the  interest  of  simplicity  of 
exposition,  these  refinements  are  not  considered  here.  It  will  be  seen 
later  that  the  maximal-smoothing  case  is  the  hardest  to  compute  back- 
wards in  time.  This  difficulty  is  not  a defect  of  our  method.  Rather,  it 
is  associated  with  the  type  of  Holder  estimate  which  obtains  in  that  case. 
Each  of  the  maximal -smoothing  and  strongly -smoothing  cases  can  be  re- 
duced to  the  minimal -smoothing  case  by  means  of  a preliminary  trans- 
formation. 

In  the  maximal -smoothing  case  this  reduction  is  accomplished  by 
stretching  the  time  variable.  With  u>  and  a the  constants  in  (4.  3)  and 

(4. 10) ,  define  the  function 

(4.11)  ijj(s)  = (-)  log(l+  «s/u),  s > 0 . 

a — 

Then,  ifi’(s)  > 0,  4j" (s)  < 0 and 

(4. 12)  «i|j"  + a(4/)2  = 0 . 


From  the  bilinear  form  a(t;u,  v)  in  (4.1),  we  construct  the  form  b(s;u,  v) 
where 


(4.13) 


b(s;u,  V)  = a(4*(s);u,  V)  i|i'(s),  s > 0 , 


/ 


b (x,  s)  dSj  D^v  dx 
pq 
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with 


(4. 14)  b (x,  s)  = a (x,  4»(s))  ^'(s),  s > 0 . 

Pm  P4 

Let  b'(s;u,  v)  be  the  symmetric  bilinear  form  on  H^O),  obtained  from 

9b 

(4.13)  by  replacing  b (x,  s)  by  — . We  then  have, 

(4.15)  b'(s; u,  v)  =!  4j"  a(^(s);u,  v)  + (i|i')2  a(ijj(s);u,  v)  . 

Hence,  using  (4.  3),  (4.10),  (4.12)  and  the  fact  that  4»"  < 0,  we  obtain 
from  (4. 15), 

(4.16)  b'(S;  v,  V)  < [a(i|j')2  + W^"]||v||^  = 0,  V v l H™(R)  . 

We  now  put  t ~ 4j(s)  in  the  parabolic  problem  (4.7).  Let 

(4.17)  |(x,  s)  * u(x,  4>(s)),  x < $7,  s>0  ; 

then,  £ satisfies  the  parabolic  problem, 

(4.18)  $s  = -G0(s)e,  xi  0,  s > 0 , 

(4. 19)  s0,  xi  80,  s > 0,  | p | < m - 1 , 

where  GQ(s)  = ^'(s)  Pq(4j(b))  . This  transformed  parabolic  problem  is  the 
one  generated  by  the  symmetric  bilinear  form  b(s;u,  v)  . Since  b'(s;v,  v)<0  , 
we  have  the  minimal -smoothing  ease  for  the  transformed  problem. 

In  the  strongly  smoothing  case,  we  put 

(4.20)  b(t;u,  v)  s a(t;  u,  v)  - 2 yt  J uvdx  , 

R 
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Then,  from  (4.9),  b (t;  v,  v)  < 0 . Next,  put 

2 

(4.21)  £(x,  t)  = e^  u(x,  t),  x e £2,  t > 0 . 

in  the  parabolic  problem  (4.7).  Then  £ satisfies, 

(4.22)  £t  = -G0(t)£,  x«  £2,  t > 0 , 

(4.2  3)  DP|  = 0,  xe  9!),  t > 0,  | p | < m - 1 , 

where  Gg(t)  = PQ(t)  - 2*yt  . This  is  the  problem  generated  by  the  bilinear 
form  b(t; u,  v).  in  (4.20).  Thus,  (4.21),  transforms  the  strongly  smoothing 
case  into  the  minimally  smoothing  case. 

For  the  purposes  of  the  following  discussion,  it  may  now  be  assumed 
without  loss  of  generality  that  the  parabolic  problem, 

(4.24)  ut  = -P(t)u,  t > 0 , 

2 

is  minimally  smoothing  on  the  interval  [0,  T]  . Given  f(x)  in  L (£2)  , 
and  the  positive  constants  6 , M,  T,  consider  the  following  problem.  Find 
all  solutions  of  (4.  24)  such  that 

(4.25)  II u(* , T)  - f II  < 6 

(4.26)  II u(* , 0)  ||  < M . 

As  in  Section  3,  we  put  k = •—  log(^)  and  v = e^u  in  (4.24),  to  obtain, 

1 6 

(4.27)  vt  = -(P(t)  - k)v,  0 < t < T . 

Differentiating  with  respect  to  t,  we  obtain 


475 


(4.28) 


vtt  = A(t)v, 


0 < t < T , 


2 

where  A(t)  is  an  unbounded  self-adjoint  operator  in  L (J2)  . See  [4, 

Section  3],  In  fact,  for  each  fixed  t > 0,  A(t)  is  the  unbounded  operator 
corresponding  to  the  following  elliptic  boundary  value  problem  of  order 
4m  in  S2: 

(4.29)  [(PQ(t)  - k)2  - PQ(t)]u  = 0,  x«  P, 

(4.30)  DPu  = DP[P0(t)u]  = 0,  xe9S2,  | p | < m - 1 ; 

where  P0(t)  is  the  differential  operator  obtained  from  PQ(t)  in  (4.  5)  by 
differentiating  the  coefficients  with  respect  to  t . Since  the  problem 
(4.24)  is  minimally -smoothing  by  hypothesis,  we  have, 

(4.  31)  < PQ(t)v,  v)  < 0,  Vv  € H™(J2)  . 

Consequently,  A(t)  in  (4.28)  is  a non -negative  self-adjoint  operator  for 
each  t . Note,  however,  that  even  though  (4.24)  is  a fixed  domain  parabolic 
problem,  the  backward  beam  equation  (4.28)  now  involves  a variable 
domain  operator,  A(t),  in  general.  See  [4,  Section  3].  Unlike  the  problem 
in  Section  3,  it  is  no  longer  possible  to  prove  existence  theorems  for  (4.  28) 
by  using  the  spectral  representation  of  A(t)  at  each  t.  Remarkably  enough, 
one  can  obtain  strong  results,  (i.  e.  existence  of  solutions  lying  in  the 
domain  of  A(t)  for  each  t ),  even  in  this  variable  domain  case,  for  the 
two-point  problem  assoc,  ^ted  with  the  finite  difference  analog  of  (4.  28). 

No  assumptions  need  be  made  about  the  manner  in  which  D.(t)  varies  with 
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t,  and  the  proof  uses  only  the  " m-accretiveness"  of  A(t)  for  each  t . 
Let  T = (N+1)A  t,  and  consider  the  system  of  difference  equations, 


(4.  32) 


n+1  „ n n-1 
w - 2w  + w „n  n 

-Aw  = 0, 

A t 


n = 1,  2,  . . . , N , 


0 N+1  u 

(4.  33)  w = a,  w = b , 

2 n 

where  a,  b e L (ft),  and  A =A(nAt)  . In  [3],  the  following  theorem  is 
proved. 

Theorem  4. 1 

There  exists  a unique  solution,  w(t),  in  (4.  32),  (4.  33),  with 

w(t)  e D (t),  for  each  t = nat,  n = 1,  2, . . . , N,  and  this  for  arbitrary 

2 

a,  b € L (ft)  . Moreover, 

(4.34)  llw(t)ll  <^IUII  + | (|bil  . 


The  proof  of  Theorem  4.1  given  in  [3]  is  constructive,  and  is  based 
on  the  tridiagonal  algorithm.  If  finite  element  methods  are  used  to  discretize 
A(t)  for  each  t,  such  methods  preserve  the  accretiveness  of  A(t)  . 
Consequently,  the  resulting  fully-discrete  scheme  also  has  the  norm  con- 
vexity property  (4.  34).  In  practice,  iterative  methods,  such  as  block 
relaxation  techniques,  may  be  used  to  solve  the  block  tridiagonal  system 
of  linear  equations.  See  [4]. 

We  now  turn  to  the  question  of  approximating  the  solutions  to  the 
parabolic  problem  (4.7),  backwards  in  time,  given  an  a priori  bound,  M, 
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2 

for  the  initial  data,  and  given  a function  f(x)  c L (ft),  such  that 

II  u(  - , T)  - f J|  <6  . If  (4.7)  is  minimally  - smoothing,  we  solve  the  system 

(4.  32)  with  the  two -point  conditions, 


(4.35) 


N+l  kT, 
w = e f , 


and 


(4.  36)  k = - log  j . 

We  then  define, 

(4.37)  uapp(t)  = e"kt  w(t) 


as  our  approximation  to  the  solutions  of  the  backwards  problem.  Thus, 
this  case  is  treated  in  exactly  the  same  way  as  the  problem  in  Section  3. 

If  (4.7)  is  strongly- smoothing , we  first  transform  to  the  minimal 
case  by  means  of  (4.  21),  and  solve  the  resulting  system  (4.  32)  for  the 
transformed  problem,  with  the  two -point  conditions, 


(4.  38) 


N+l  yT 
w = e ' 


where, 

(4.39)  k = ^ {logM  -log(evT  6)}  . 
We  then  define, 

(4.40)  u (t)  = e e ^ w(t) 

app 
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as  our  approximation  to  the  solutions  of  the  original  problem.  Finally, 
in  the  maximal -smoothing  case,  we  transform  to  the  stretched  variable, 
s,  as  in  (4.17).  Let  A s = S/N+l,  where 

(4.41)  S = (|)  [e“T/w  - 1]  . 


We  solve  the  system  (4.  32)  in  the  s-variable  with  the  two-point  conditions, 


(4.42) 


N+l 

w 


. . 1 . . M , 

and  k = - log(  — ) . 

Let 


(4.43) 


+-‘(t)  * ! [eat/w 


-1]  . 


As  our  approximation  to  the  solutions  of  the  original  problem,  we  define, 

(4.44)  uapp(t)  = exp[-k4j_1(t)]  w(4;‘1(t))  , 

where  w(s)  is  the  solution  of  (4.  32),  (4,42).  We  then  have  the  following 
result. 

Theorem  4.  2 

Let  u(t)  be  any  solution  of  the  stabilized  backwards  problem  for 
(4.7),  and  consider  uQpp(t)  ■ In  the  minimally  smoothing  case,  we  have, 

T-t  t_ 

(4.45)  ||u(t)  -u  (t)||  < M T 6T  + 0(At2)  . 

aPP  — 

In  the  strongly -smoothing  case, 
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(4  *6) 


T-t  t_ 

llu(t)  - u (t)||  < exp[vt(T-t)]  M T 6T  + 0(At2)  . 
app  — 

Finally,  in  the  maximal  smp  thing  case, 

(4.47)  llu(t)  - u (t)  ||  < M1_M,(t)  6^(t)  + 0(^t2)  , 

app  — 

where, 

at/u 

(4.48,  ' 

e - 1 

Proof 

The  proof  of  each  of  the  three  inequalities  follows  from  the  norm- 

convex  property  of  the  solution  of  the  backward  beam  equation  associated 

with  each  of  the  transformed  problems,  as  in  the  proof  of  (3. 12)  in 

Theorem  3. 1,  together  with  a subsequent  inverse  transformation  in  the 

2 

case  of  the  last  two  inequalities.  See  [2],  The  extra  term  0(at  ) 
represents  the  combined  spatial  and  time  discretization  errors,  as  we  are 
now  considering  the  fully  discrete  scheme  as  opposed  to  the  evolution 
equation  (4.28).  It  is  assumed  that  the  spatial  mesh  is  chosen  so  that 
the  spatial  truncation  error  is  of  the  same  magnitude  as  that  of  the  time 
discretization. 

Remark.  By  making  a t 0 in  the  above  error  estimates,  we  recover 
logarithmic  convexity  estimates  originally  obtained  by  Agmon-Nirenberg, 
[1].  Note  the  exponential  decay  to  zero  as  t i 0,  of  the  exponent  |j.(t) 
in  (4.47).  An  example  of  the  maximal -smoothing  case  is  provided  by  a 
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simple  diffusion  equation  in  which  the  diffusion  coefficient  grows  with 
time.  The  strongly  smoothing  case  corresponds  to  a diffusion  equation 
with  a growing  zero  order  term.  The  minimal  case  corresponds  to  a 
constant  or  decaying  diffusion  coefficient.  By  considering  a diffusion 
coefficient  dependending  only  on  t,  one  can  show  that  (4.47)  is  sharp. 

In  such  a problem,  considerably  more  precision  in  measurement  is  necessary 
at  time  T,  in  order  to  attain  significant  accuracy  backwards  in  time,  as 
compared  with  the  other  two  cases. 
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5.  Computing  small  solutions  of  Burgers’  equation  backwards  in  time 
We  shall  now  describe  some  recent  results  dealing  with  the  ap- 
plication of  the  backward  beam  method  in  the  computation  of  the  final 
value  problem  for  Burgers'  equation.  The  reader  is  referred  to  [6],  for 
proofs  and  a more  detailed  discussion  of  the  main  results.  We  consider 
the  following  initial  boundary  value  problem  for  the  one  dimensional 
Burgers'  equation, 

(5.1)  uts  vuxx'uux  + f(x>t)>  0 < x < L,  0<t<T  , 

(5.2)  u(0,  t)  = u(L,  t)  = 0,  t > 0 , 

(5.3)  u(x,  0)  = a(x),  0<x<L  , 

where  a(x)  and  f(x,  t)  are  sufficiently  smooth  that  the  (unique)  solution 
of  (5.1)-(5.3)  has  sufficiently  many  derivatives  on  [0,  T].  Let  A be  the 
positive  self-adjoint  operator  corresponding  to  -u"  with  zero  boundary 
conditions,  and  let 

(5.4)  F(u)  = -uux  . 

We  may  then  write  (5. 1) -( 5.  3)  in  the  form  of  an  evolution  equation  in 
L2[0,L],  viz, 

(5.  5)  u = -vAu  + F(u)  + f(t),  0 < t < T , 

(5.  6)  u(0)  = a . 
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One  way  of  solving  the  forward  problem  is  by  means  of  the 


following  iterative  procedure, 

(5.7)  u®  = -vAu°  + f(t),  0 < t < T , 

(5.8)  u°(0)  = a , 
and  for  each  m = 1,  2,  3,  . . . , 

(5.  9)  u“  = -vAum  + F(um_1)  + f(t),  0 < t < T , 

(5. 10)  um(0)  = a . 

In  fact,  this  procedure  was  used  by  Kato  and  Fujita  in  [10],  [11],  as  a 
means  of  proving  existence  and  uniqueness  theorems  for  the  Navier-Stokes 
equations,  which  they  viewed  as  an  initial  value  problem  in  Hilbert  space. 
An  important  feature  of  the  above  iteration  is  that  one  proceeds  through  a 
sequence  of  inhomogeneous  linear  parabolic  problems  with  constant  co- 
efficients. Concerning  the  convergence  of  this  iterative  process,  we  have 
the  following  theorem.  See  [6]. 

Theorem  5. 1 

I I „ l „ 

Let  a(x)  belong  to  D(A2),  and  let  f(t)  c D(A2)  with  ||A2f(t)||  « 

L^O,  T]  . Let 

(5.11)  ( ■fttJ'.T. ) [ ||  A2a  ||  + f ||  A2  f(s)  ||ds]  < 1 ; 

v 0 

1 

i m 64  LT  ^ 

then  A2u  exists  on  [0,  T]  for  every  m . Let  9 = 1 - [1  - ( — ) 

III  Atu°  III  ]T  . Then  0 < 9 < 1,  and 
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(5.12) 


1 

2 


In  the  above  statement  of  the  theorem,  lllv|||  denotes  Sup  ||v(t)||,  and 
i ? L _ 0<t<T 

II  A2  v(t)  ||  = f |v  (x,  t)  | dx  . 

0 x 

The  condition  (5. 11)  is  sufficient  for  convergence  in  the  norm 

III  HI  . In  practice,  convergence  may  occur  even  if  (5.11)  is  severely 
violated,  by  as  much  as  a factor  of  1000  in  some  cases.  See  [6]  for 
several  numerical  examples.  On  the  other  hand,  (5. 11)  cannot  be  too 
severely  violated  for  convergence  to  occur  on  [0,T]  . In  general,  the 
convergence  of  (5.  7) -(5. 10)  is  only  local  in  time,  even  if  a unique  smooth 
solution  exists  for  all  t > 0 . An  example  of  divergence,  except  for  t 
sufficiently  small,  is  given  in  [6].  In  that  example,  the  exact  solution  is 
known  and  is  found  to  develop  steep  gradients,  i.  e.  a smooth  approxima- 
tion to  a "shock"  evolves  from  the  initial  data.  Convergence  occurs  before 
the  gradients  become  too  steep.  See  also  Example  2 and  Figure  1 below. 

The  essential  idea  behind  the  algorithm  for  the  backwards  problem, 
is  to  solve  each  linear  parabolic  problem  in  the  Kato-Fujita  sequence  of 
iterates,  backwards  in  time,  using  the  value  of  the  solution  to  (5.  5),  (5.  6) 
at  time  T . Each  such  linear  backwards  problem  is  solved  via  the  backward 
beam  equation  discussed  in  Section  3.  Since  each  linear  problem  has 


484 


constant  coefficients,  the  "Fourier  Method"  of  Kreiss -Oliger  can  be  used 
to  great  advantage.  That  is,  one  calculates  the  spatial  derivatives  at 
the  mesh  points,  by  differentiating  the  trigonometric  polynomial  which 
interpolates  the  function  values  at  equally  spaced  grid  points,  using  at 
least  2 points  per  significant  wave  length.  If  the  data,  inhomogeneous 
term,  and  solution  have  smooth  periodic  extensions,  this  technique  for 
differentiation  is  highly  accurate  as  shown  in  [13].  It  is  also  extremely 
attractive  in  that  one  can  make  use  of  Fast  Fourier  Transform  algorithms. 
(Such  a method  has  been  employed  by  Orszag  in  [15],  for  the  Navier- 
Stokes  equations.)  Finally,  with  this  particular  technique  for  discretizing 
the  space  variable,  the  algebraic  problem  of  inverting  the  "block  tri- 
diagonal"  matrix  in  (3.15)  becomes  trivial.  One  has  a scalar  positive 
definite  tridiagonal  matrix  to  invert  for  each  Fourier  component  in  turn, 
and  such  matrices  can  be  efficiently  inverted  by  a standard  algorithm.  It 
should  be  remarked  that  the  high  accuracy  in  spatial  discretization  is 
quite  important  in  the  present  equation,  where  there  may  be  high  frequency 
components  in  the  solution  at  positive  times,  even  if  such  components  are 
absent  in  the  initial  data.  This  is  a basic  property  of  the  homogeneous 
equation.  See  the  classic  paper  by  Julian  Cole  in  [7]. 

To  motivate  our  main  result,  consider  the  following  "thought  ex- 
periment". Let  {um(t)}^_0  ke  tlie  sequence  of  iterates  in  the  forward 
problem,  and  let  um(T)  - bm  . Imagine  that  each  b^(x)  is  known  ap- 
proximately. Let  be  the  approximate  value  of  b^,  and  suppose 
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(5.14) 


IU2(bm  - b m)  1)  < 6,  m = 0,1,2,  ; 

Suppose  further  that  (5. 11)  is  satisfied  for  the  forward  iteration.  Then, 
from  (5.12),  we  have 

(5.15)  |||A}um|||<  M=0(^-t)T  . 

We  may  then  pose  the  following  linear  backwards  parabolic  problem  for 
each  iterate  um(t); 

Find  all  solutions  of 

(5.16)  u“  = -vAum  + F(um_1)  + f(t),  0 < t < T , 

such  that 

(5.17)  ||  A^(b  - um(T))||  < 6 

m —* 

JL  m 

(5.18)  ||  A3u  (0)||  < M . 

Using  the  backward  beam  equation  with  k = — log(^  ),  each  um(t)  may 
then  be  approximately  solved  backwards  in  time,  and  used  to  generate 
a new  approximate  inhomogeneous  term  for  the  next  iteration.  Because 
of  the  "destruction  of  information"  as  t l 0,  it  is  clear  that  sizable 
errors  will  be  generated  at  t = 0 and  passed  on  to  the  next  successive 
iterate.  The  next  theorem  assesses  the  accumulated  error  after  m steps 
of  this  process.  See  [6]  for  the  proof. 

Theorem  5.  2 

Let  (5.11)  be  satisfied.  Let  wm(t)  be  the  sequence  of  successive 

approximations  obtained  via  the  backward  beam  equation.  Let  u(t)  be 
the  solution  of  (5.  5),  (5.  6).  Then  there  exists  a positive  constant  , 
depending  only  on  m,  such  that 
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T-t  t_ 

(5.19)  HaVV)  - A*u(t)ll  < Cm[log(^)]  mM  T 6 T + 

, „ „m-3  1 

where  B = 2 + — . 

m 2 

Remark . It  follows  from  (5. 19)  that  given  any  e > 0,  one  can  make 
||A^wm(t)  - A^u(t)  ||  < e,  uniformly  on  compact  subintervals  of  (0,  T]  , 
by  choosing  6 sufficiently  small  and  m sufficiently  large.  On  the 
other  hand,  even  though  (5. 11)  is  satisfied  so  that  the  forward  iterates, 
um(t),  converge  on  [0,T],  the  above  inequality  does  not  imply  convergence, 
for  fixed  6 > 0,  as  m -*  oo  . 

The  above  theorem  is  not  strictly  applicable  to  the  algorithm  which 

is  used  in  practice.  In  the  first  place,  (5. 11)  may  be  severely  violated. 

In  the  second  place,  the  functions  b (x)  are  not  available.  Rather,  one 
* m ' ’ 

has  an  approximation  b (x)  to  the  terminal  value  u(x,  T),  where  u is 
the  solution  of  the  non-linear  problem  ( b.  1) - ( 5.  3),  and, 

(5.20)  ||a*u(-,T)  - A^  b ||  < 6 . 

From  physical  or  other  considerations,  one  may  obtain  an  a priori  bound, 

(5.21)  III A^u  HI  < M . 

~ 1 M 

Using  the  given  data  b(x)  and  setting  k = — log(^-  ),  each  inhomogeneous 
linear  parabolic  problem  is  then  solved  backwards  with  the  backward  beam 
equation.  In  an  extensive  series  of  numerical  experiments,  with  problems 
for  which  exact  solutions  are  known,  it  is  then  found  that  Theorem  5.  2 is 
qualitatively  correct  as  far  as  the  algorithm  which  is  used  in  practice  is 
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concerned.  Thus,  the  solutions  are  well  approximated  after  a relatively 
small  number  of  iterations.  Further  iterations  may  lead  to  rapid  divergence, 


even  though  the  forward  iteration  converges.  In  many  cases,  one  observes 
considerable  improvement  before  the  onset  of  divergence.  The  situation 
is  analogous  to  that  of  the  divergent  power  series  in  the  theory  of 
asymptotic  expansions,  where  the  first  few  terms  often  provide  excellent 
approximations.  Furthermore,  in  several  experiements,  the  distance  back 
into  the  past  where  significant  accuracy  can  be  attained,  is  greater  than 
might  be  expected  from  the  a priori  stability  estimate  for  Burgers'  equa- 
tion given  in  Section  2.  We  shall  now  give  several  numerical  examples. 
Example  1 

Consider 

(5.  22)  U=vu  - uu  , 0 < x < ir,  0 < t < 1 , 

' t xx  x’  ’ * 

(5.23)  u(0,  t)  = u(ir,  t)  = 0,  0<t<l  , 

(5.24)  u(x,  0)  = uQ  Sinx,  0 5 x £ n ■ 


For  any  positive  v and  any  uQ,  the  exact  solution  of  this  problem  was  ob- 
tained by  Cole  in  [7].  It  is  given  by 


(5.  25) 


u(x,  t) 


4v  £ 
0=1 


00  2 u 

-vn  t T . 0 . 0, 
e n I (— ) Sin  nx 

n Zv 


00  ^ 

0 . „ -vn  t 0 _ 

y2v>c°snx 

n=l 


where  I (z) 
n 


is  the  modified  Bessel  function  of  the  first  kind.  From  (5.25), 


we  observe  the  effect  of  non-linearity,  in  that  the  initial  pure  sine  wave 
evolves  into  a periodic  function  in  which  all  frequencies  are  present.  It 
is  instructive  to  associate  a Reynolds  number  with  the  above  problem. 
Following  Cole,  [7],  we  define 

V 

(5.26)  Re  = — — . 


In  this  example  we  chose  uQ  = 1 and  v = . 0025,  so  that  Re  = 126  . As 
far  as  the  sufficient  condition  for  convergence  of  the  forward  iteration  is 
concerned,  we  actually  have,  in  the  present  case, 


(5.27) 


64  LT 
v 


The  expression  (5.25)  was  evaluated  at  64  equally  spaced  mesh  points 
on  [0,  2ir],  at  T = 1,  to  generate  the  terminal  data.  The  backward  beam 
method  was  then  used  with  At  - and  with  Fourier  techniques  to 

discretize  the  space  variable,  using  64  equally  spaced  points  on  the 
period  interval  [0,  2tt]  . The  computations  were  performed  in  single  pre- 
cision on  UNIVAC  equipment  at  the  University  of  Wisconsin.  Thus,  the 

- 8 

unit  round-off  error  is  of  the  order  of  10  . Using  M = . 1,  we  then 

M 

obtain  k = log(  — ) ~ 18.  7 . 

Since  the  exact  solution  is  known,  a comparison  of  the  computed 

solution  with  the  exact  solution  was  made  after  each  successive  iteration. 

At  each  time  t = nAt,  n = 1,  2,  . . . , N,  the  relative  error  in  the  discrete 
2 

spatial  L norm,  was  computed  after  each  iteration,  to  observe  the 
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behavior  of  the  backwards  iteration.  This  error  is  tabulated  in  Table  1, 
for  the  first  six  iterations,  for  17  values  of  t lying  between  zero  and 
1 , Little  change  in  the  relative  error  pattern  appears  after  the  fourth 
iteration.  Despite  the  moderately  large  Reynolds  number  of  126  in  the 
present  example,  and  the  appearance  of  the  exponential  factor  exp[Re] 
in  the  Holder  estimate  for  Burgers'  equation,  we  see  that  even  with  6 of 
the  order  of  . 1 x 10"  = 10  , a relative  error  of  less  than  10%  is  achieved 

as  far  as  93%  of  the  way  back  from  T = 1 . In  the  above  example,  the 
exact  solution  does  not  develop  steep  gradients  within  the  time  interval 
[0, 1] . Moreover,  the  forward  iteration  converges,  although  (5. 11)  is 
violated. 

Example  2 

The  problem  is  again  (5.  22) -(5. 24)  but  with  u^  =*  40  and  v = 1 . 

As  in  the  previous  example, 

(5.28)  Re  s 126  . 

However,  in  lieu  of  (5, 11),  we  now  have 

(5.29)  (£ii!)  || a* a ||  * 711  . 

In  this  example,  the  forward  iteration  diverges  for  t > , 04  . An  independent 
evaluation  of  the  exact  solution,  using  (5.2  5),  reveals  that  the  initial 
sine  wave  develops  steep  gradients  almost  immediately.  See  Figure  1 . 

A smooth  approximation  to  a "shock"  evolves,  broadens,  and  then  dies 
out.  Attempts  were  made  to  compute  this  problem  backwards  in  time,  in 
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TABLE  1. 


2 

Relative  error  in  the  L norm,  as  a function  of  time  and 
number  of  iterations,  in  the  backwards  computation  of  Example  1 


S\No  of 
\ltems 

timen^ 

1 

2 

3 

4 

5 

6 

. 0332 

. 29+00 

. 29+00 

. 29+00 

. 29+00 

. 29+00 

. 29+00 

♦ 0664 

. 95 -01 

. 89-01 

. 85-01 

. 84-01 

.84-01 

.84-01 

. 0997 

.54-01 

.43-01 

. 35-01 

. 30-01 

. 30-01 

. 30-01 

. 1661 

.46-01 

. 34-01 

. 26-01 

.21-01 

.21-01 

.22-01 

. 2326 

.42-01 

. 28-01 

. 22-01 

. 19-01 

. 19-01 

. 19-01 

. 2990 

.37-01 

. 22-01 

. 17-01 

. 15-01 

. 16-01 

. 16-01 

. 3654 

.34-01 

.21-01 

. 17-01 

. 16-01 

. 17-01 

. 17-01 

.4319 

.29-01 

. 16-01 

.14-01 

. 13-01 

. 13-01 

. 13-01 

.4983 

. 27-01 

. 11-01 

. 84-02 

. 80-02 

. 81-02 

. 81-02 

. 5648 

. 23-01 

. 93-02 

. 79-02 

. 77-02 

. 78-02 

. 78-02 

. 6312 

. 18-01 

. 75-02 

. 66-02 

. 65-02 

. 65-02 

. 65-02 

. 6977 

. 17-01 

. 13-01 

. 12-01 

.12-01 

. 12-01 

. 12-01 

. 7641 

. 12-01 

. 29-02 

. 26-02 

. 26-02 

. 26-02 

. 26-02 

. 8306 

. 82-02 

.40-02 

. 39-02 

. 39-02 

. 39-02 

. 39-02 

. 8970 

. 83-02 

. 75-02 

.75-02 

. 75-02 

. 75-02 

. 75-02 

. 9634 

.45-02 

.47-02 

.47-02 

. 47-02 

. 47-02 

.47-02 

.9967 

.32-02 

. 32-02 

. 32-02 

.32-02 

. 32-02 

. 32-02 
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SQL  Lit  ION  PT  TIME  ,016S 
( — = INITIAL  OPT R ) 


SOLUTION  PT  T I ME  -G498 
( — = INITIPL  CfiTP  ) 


SOLUTION  PT  TIME  .0332 
( ---  = INITIAL  QRTP  I 


SOLUTION  AT  TIME  . M9S 
I ---  - INITIAL  ORTfi  J 


SOLUTION  RT  TIME  ,3322 
( ---  = INITIHL  ORTfi  I 


SOLUTION  RT  TIME  .7309 
t — = INITIAL  ORTA  ) 


Figure  1. 


Development  of  steep  gradients  in  the  exact  solution 
of  the  problem  in  Example  2. 
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single  precision,  starting  from  a time  sufficiently  close  to  zero,  that 
the  "shock"  would  still  be  clearly  defined  in  the  terminal  data.  This 
amounts  to  reconstructing  steep  gradients  after  they  have  been  smoothed, 
and  clearly  requires  considerable  precision  in  measurement.  No  measurable 
success  was  achieved  in  this  experiement. 

Although  Example  1 has  the  same  Reynolds  number  as  the  present 
example,  it  appears  that  the  exponential  factor  in  the  stability  estimate 
plays  a much  more  important  role  in  the  present  problem. 

Example  3 

We  now  consider  an  inhomogeneous  problem, 

2 

(5.  30)  u = vu  - uu  + 9TTe  Sin4rrx,  0 < x < 1 , 

' t xx  x ’ * 

0 < t < 1 , 

(5.  31)  u(0,t)  = u(l,t)  = 0,  t > 0 , 

(5.32)  u(x,  0)  = 3 Sin  2ttx,  0<x<1  • 

The  exact  solution  is 

2 

(5.33)  u(x.t)  = 3e  ^ vtSin2Trx  . 

We  chose  v = 3/14  so  that  Re  = 14  in  this  experiement.  Note 
however  that  in  lieu  of  (5. 11),  we  have 

i t 

(5.34)  (^V  [ II  A^a||  + f ||  f(s)  ||ds]  » 487  . 

V 0 
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Convergence  of  the  forward  iteration  occurs  at  this  and  even  higher  values 
of  (5.  34)  in  this  type  of  example.  However,  even  here,  where  the  solu- 
tion does  not  develop  steep  gradients,  divergence  of  the  forward  iteration 
occurs  when  v = . 05,  in  which  case  (5.  34)  has  a value  of  2700  . See 
[6],  The  importance  of  the  value  of  (5.  34),  rather  than  the  Reynolds 
number,  is  quite  apparent  in  the  behavior  of  either  the  forward  or  back- 
wards iterations,  and  this  remains  valid  in  all  our  experiments. 

In  the  backwards  computation  of  this  problem,  the  relative  error  at 
93%  of  the  way  back  from  T = 1,  was  found  to  be  of  the  order  of  500% 
after  the  first  iteration!  This  error  was  then  reduced  to  less  than  10% 
after  six  iterations.  At  77%  of  the  way  back  from  T - 1,  the  initial 
relative  error  of  110%  was  reduced  to  less  than  . 3%  after  six  iterations. 

In  Figure  2,  the  first  seven  iterates  are  plotted,  together  with  the  exact 
solution,  at  t = .0664,  the  93%  value.  In  Figure  3,  the  iterates  at  77% 
of  the  way  back  are  depicted.  Again  the  influence  of  the  exponential 
factor  in  the  convexity  estimate,  does  not  seem  to  be  present  in  this 
single  precision  computation.  Indeed,  the  accuracy  which  one  can  achieve 
at  such  long  distances  into  the  past  is  very  encouraging. 

4.  An  example  of  "asymptotic"  convergence 

In  Example  3,  the  value  of  487  in  (5.  34)  is  somewhat  of  a critical 
value  insofar  as  observing  the  divergence  phenomenon  suggested  by 
Theorem  5.  2,  in  less  than  eight  or  nine  iterations.  We  consider  now  the 
problem  in  Example  3 with  a slightly  lower  value  of  v,  v = 3/14.  256789. 
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Figure  2. 


Convergence  of  the  iteration  at  93%  of  the  way  back  from 
T = 1,  in  the  computation  of  Example  3 backwards  in  time. 
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Figure  3.  Convergence  of  the  iteration  at  77%  of  the  way  back  from 
T = 1,  in  the  computation  of  Example  3 backwards  in  time 


We  then  have, 


1 «p 

(5.35)  (^1H)2  [ || A*a ]|  + f ||A^f(s) lids]  = 496  . 

v 0 

In  Figure  4,  the  first  seven  iterates  at  77%  of  the  way  back  from  T = 1 
are  depicted.  The  third  iteration  gives  the  closest  agreement.  A more 
detailed  discussion  of  this  and  other  features  of  the  algorithm  is  given  in 
[6].  The  forward  iteration  converges  in  this  example. 
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Figure  4 . 


Asymptotic  convergence  phenomenon  at  77%  of  the  way  back, 
from  T = 1,  in  the  computation  of  Example  4 backwards  in  time. 
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SPECIAL  SOLUTIONS  OF  THE  ONE-DIMENSIONAL  PARABOLIC  EQUATION 


Siegfried  H,  Lehnigk 
Physical  Sciences  Directorate 

US  Army  Missile  Research,  Development  and  Engineering  Laboratory 

US  Army  Missile  Command 
Redstone  Arsenal,  AL  35809 


ABSTRACT,  From  a mathematical  point  of  view,  two  classes  of  con- 
servative diffusion  processes  are  discussed  which  can  be  described  by 
means  of  a similarity  variable  which  depends  linearly  on  the  original 
space  variable. 

1.  INTRODUCTION.  We  are  interested  in  finding  solutions  Q(x,t) 
of  the  parabolic  equation 

(1.1)  A(x)  Qxx  + B(x)  Qx  + C(x)  Q = Qt 

in  the  domain  D:  x>0,  t>0,with  the  property 

1 

Q(x,  t)  eL  [0,®)  Vt  > 0 

and,  in  particular, 


0 s 


Q(x, t)  ux  3 const < 


Op  # 


Such  solutions  are  called  conservative. 

The  problem  of  conservative  solutions  has  a long  history.  We  give 
three  typical  references:  Fourier  [1],  Doetsch  [2],  and  Feller  [3], 

We  assume  that  A,B,C  e C(Q,®).  Equation  (1.1)  is  a Fokker-Planck 
equation  if  and  only  if  C " B’  - A".  In  this  case,  it  can  be  written  in 
the  form 


caq]xx  - mx  - Qt 


with  B ■ 2A'  - B. 

Let  us  consider  two  examples: 

Ex.  1 . (1.1)  with  A = aoc,  B = + p2x»  C ”^2*  o'  > 2®^  (Fokker- 

Planck).  This  equation  has  been  thoroughly  investigated  6y  Feller  [3] » 
In  D it  has  the  particular  solution 
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3,  TWO. .SKIS  OF  COEFFICIENT  FUNCTIONS.  We  now  give  two  sets  of 
coefficient  functions  A(x) , B(x),  and  C(x)  for  equation  (2.1)  which  sat- 
isfy the  conditions  (2.6)  and  (2,8).  Proofs  and  all  details  will  be 

rvra  . on  j-o/j o 1 oankora 


(3.7) 


Q(x,t)  = b"l(t)  l 1(exp  - |a‘1p1  log2  ?;) 

X j^Cl  1F1  ^2  (2  " CT)’2:2  l0g2  tj 

1F1^2  ^ "CT^’2’2a  ^1  F°g 


log  ? 


+ C 


C ^ 2 = const 


, 5 = xb“1(t) 


where  b(t)  is  determined  by  (3.6)  under  the  initial  conditions  t = 0, 
bQ  > 0,  with  n = 3or  - p , and  0 


/» 


Q(x,  t)  dx  = 


0 if  a > 0 

^2  1^  if  <7  = 1 (Fokker-Planck) 


4.  INITIAL  AND  BOUNDARY  BEHAVIOR.  It  is  appropriate  to  conclude 
with  a brief  remark  on  the  initial  and  boundary  behavior  of  the  solutions 
covered  by  Theorems  A and  B. 

In  general  diffusion  processes  in  the  domain  x > 0,  t > 0,  it  does 
not  make  sense  to  approach  the  origin  (0,0)  because  of  possible  disconti- 
nuities in  the  initial  and  boundary  behavior  of  solutions  at  that  point. 
Therefore,  one  has  to  consider  the  general  initial -bound ary  value  problem 
£2]  with  perpendicular  approach  to  the  boundaries  of  the  domain  x > 0, 
t > 0,  which  excludes  the  approach  to  (0,0).  From  this  point  of  view,  a 
solution  Q(x,t)  of  (2.1)  is  called  singular  if 

Q(x,t)  4 0 as  t 1 0 for  fixed  xe (0,“) 


and 


Q(x, t)  4 0 as  x I 0 for  fixed  te (0,“)  . 

This  terminology  is  due  to  Doetsch  [5]  (see  also  [2]). 

The  conservative  solutions  (3.3)  of  (2.1)  with  coefficients  (3.1) 
are  singular  if  bQ  = 0 and  if  either 

1)  \ < 1 , l + \-  a^>0  , 
or 


2)  X > 1 


The  existence  of  singular  solutions  is  rather  disturbing  within  the 
framework  of  diffusion  theory  and  its  applications  since  they  introduce 
nonuniqueness  of  the  general  initial-boundary  value  problem. 

The  solutions  (3.7)  of  equation  (2.1)  with  coefficients  (3.5)  are  not 
singular  since  bQ  is  positive.  They  do  not  go  to  0 as  t ; 0. 

REFERENCES 

1.  J.  B.  J.  Fourier,  Th^orie  analytique  dela  Chaleur,  OEuvres  de  Fourier, 
vol,  1,  Gauthier-Villars,  Paris  (1888). 

2.  G.  Doetsch,  Handbuch  der  Laplace-Transformation,  vol.  3,  Birkhauser, 
Basel  (1956). 

3.  W.  Feller,  Ann.  Math.  54,  173  (1951). 

4.  S.  Kepinski,  Math.  Ann.  61,  397  (1905). 

5.  G.  Doetsch,  Math.  Z.  22,  293  (1925). 


509 


INTEGRATION  OF  /“  F(x) JQ(«Xx) J^bxjdx 
Shunsuke  Takagi 

U.S.  Army  Cold  Regions  Research  And 
Engineering  Laboratory,  Hanover,  NK 

INTRODUCTION 

Infinite  integrals  involving  Bessel  functions  under  the  integral 
sign  are  of  extreme  importance  in  many  branches  of  mathematical  physics. 
We  encountered  several  types  of  the  integrals  of  the  form 

oo 

f F(m)  J^(bx)  dm  (l) 

JQ 

in  the  study  of  the  viscoelastic  deformation  of  an  ice  plate  floating 
on  water  (Ref.  1). 

When  F(m)  is  an  even  function  and  has  only  algebraic  singularities 
(i.e.  poles  and  branch  points),  the  integral  (l)  can  be  transformed, 
as  shown  later,  to  a contour  integral.  Therefore,  when  poles  only 
constitute  the  singularities,  the  application  of  the  residue  theorem 
enables  us  to  integrate  it. 

There  are  many  other  more  complicated  integrals  that  cannot  be 
simply  integrated  by  use  of  the  residue  theorem*  A simple  example  is 
a variant  of  (l)  where  F(x)  has  a branch  point,  A more  complicated 
example  is  the  one  where  F(x)  has  an  essential  singularity;  the  appli- 
cation of  the  contour  integral  is  out  of  the  question  in  this  case. 

We  may  hope,  however,  that  all  these  difficulties  can  be  resolved  by 
use  of  Barnes1  integral  representations, 

Watson  states  that  f,the  Bessel  function  under  the  integral  sign 
may  be  replaced  by  the  contour  integral  of  Barnes1  type  involving 
Gamma  functions,  and  the  order  of  the  integration  is  then  changed;  this 
very  powerful  method  has  not  previously  been  investigated  in  a systematic 
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manner"  (Ref*  2>  p,  383)*  Watson  (Ref.  2),  however,  does  not  develop 
this  method  to  cover  all  the  needs  arising  from  practical  application. 
He  rather  imposes  severe  restrictions,  as  shown  later,  to  avoid  am- 
biguous applications. 

It  is  shown  in  this  paper  that  his  restrictions  may  be  removed. 
However,  as  shown  later,  different  forms  of  Barnes7  representations  do 
not  necessarily  yield  one  and  the  same  result.  Watson’s  restrictions 
extricate  us  from  this  trouble,  although  they  prohibit  the  application 
of  this  method  to  a majority  of  the  interesting  cases. 

The  objective  of  this  paper  is  to  present  the  difficulties  we  have 
encountered,  and  to  solicit  theoretical  mathematicians  to  solve  them. 

It  is  my  hope  that  the  mysteries  enshrouding  the  application  of  the 
Barnes’  representations  may  be  lifted  in  the  near  future  and  unreserved 
use  of  them  will  be  guaranteed. 


CONTOUR  INTEGRAL 


The  following  theorem  gives  the  conditions  that  enable  us  to 
transform  (l)  to  a contour  integral. 

Theorem:  Let  F(a)  be  an  even  function  of  z that  has  only  algebraic 
singularities  (i.e,  poles  and  branch  points)  in  the  upper  half  plane  of 
the  complex  variable  z = x + iy , Then, 


co 

/ 


F(<c) 


J-j {ax) 


J ( bx ) 
o 


dx 


a) 


00 

77  J Q(bz)  dz  , when  a>b 


(2) 
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i — ll  OJ 


(3) 


J^cli 


o 


) d z 


when  a<b 


CO 

where  ^ means  the  integral  passing  through  the  contour  shown  in  Figure 

_oo 

1-  The  value  at  a-b  is  given  by  the  average  of  the  limits  approached 


Figure  1-  The  contour  of  integration  of  (2)  and  (3). 


Proof:  Use  of  the  relations 


and 


shows  that 


- »<2)U) 


J (-3;  = J (s) 

O O 


j F(s)  H^has)  J±(bz)  dz  = I j f(z)  I H^1\az)  + H.[2) 


(as) 


J (2)3  ) d 3 

(a) 


The  right  hand  side  of  (a)  is  equal  to  (l). 
Use  of  the  asymptotic  formulas 


H 


1>(ag)^4 


2 i[az  - ( 3tf/U ) ] 
ir  az 


and 


Jo(Z>2h/S?  cos<fe  -v> 
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shows  that  the  left  hand  side  of  (a)  is  equal  to  (2).  The  case  a>b 
is  thus  proved.  The  case  a<b  can  he  similarly  proved. 

The  value  at  a=b  may  be  computed  by  use  of  Barnes'  theory  as  done 
by  Watson  (see  Ref.  2,  p,  ^02),  But  this  case  is  not  essential  to 
our  analysis;  we  may  not  try  to  go  through  the  complete  analysis  of 
this  case  in  the  present  paper. 


Example  1. 


/ 


),'  J.  (ax)  J ( bx ) dr 
l+xk  1 0 


(M 


» ker'a  berh  - kei’a  bei b + — when  a>p  (5) 

= ber'a  kerb  - bei' a kei2>  when  a<p  (6) 

Proof:  When  a>bt  the  integral  (1+)  transforms  to  I (a>b)  defined  below, 


I (a>b)  = 


-if  _i__H(l 

2 J 1.M*  1 


( az ) J (bz)  d z 
o 


71^/4  3t 

The  poles  in  the  upper  half  plane  are  & , e , and  zero.  Therefore 

l(a>2>)  ~ tt i Res(e7T^'//^)  + irtResCe^14^* ) + ~ • Res(0) 

Note  that  the  pole  at  z = 0 is  enclosed  with  a semicircle,  as  shown  in 


Figure  1.  Thus  we  find: 

-3n£ 


TTt- 


TT2- 


l(a>h)  = T\i 


14  H [^(ae  **)  J (be  + 
1 o 


-9^ 


3it£ 


3tt^ 


+t a ye  ^ ) J (be  H ) + 

4 1 o 

♦ 4 • (-— > 

2 v it  a 


Substituting  the  relations, 


and 


h{^(®  * ) = (ker.x  + tkei^ar) 

X TTt  X 1 

3 vi 
, 4 V 

J [xe  ) ~ ber  x + tbei  x 

0 oo 

H 

(ker.x  ■-  kei-x) 

1 Tit  1 1 


7T£ 

J.(xe  ; = ber  x - ibei  x 
6 oo 


ve  find 

I(a>i)  ■ J + - 
a /2 


(ker, a + kei,a)  ber  b + (ker.a  - kei.a)  bei  2?) 
1 l o x i o 


Using  the  relations 


and 


ker,x  + kei,x  - /2  ker’x 
11  o 


- ker.x  + kei.x  = /2  kei'x 
11  o 


I (a>b)  reduces  to  (5),  where  ve  have  dropped  the  suffix  zero. 
When  a<b,  integral  (4)  transforms  to  l{a<b)  defined  below, 


l(a<i>) 


(az)  H ^\bz)  ds 

O 


3^/4 

The  poles  in  the  upper  half  plane  are  e and  e 

1 (a<b)  = iri  Res(e1r^^*)  + irtResCe^7^^) 

Thus  we  find: 

-3irt  iri  jri_ 

l(a<b)  - ir.t  ^ e J^(dze~*S  (i>e 


-9iri 


3EL 


3ni 


+ itt  ^ e U J2(ae  4 ) B^Ube  h ) 


Therefore 


Substituting  the  relations, 
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1 

3tt£ 


\xe  ) c — ~r  (ker  x + ikei  x) 
o tti  o o 


^ 21  ^erjx  + 


Trt, 


^xe  H)  ~ (ker  * ikoi  x) 

O TT't-  ^ ^ 


XL 

4, 


J^(xe  ) = - (ber^r  - iber^m) 
ve  find 


I (a<b)  = - 

& 


(ber^a  + bei1a)ker2>  + (ber^a  - bei^ft)  kei£ 


Using  the  relation: 


ber^x  + bei^x  = Sz  ber'x 
-ber-^ar  + bei^x  = Sz  bei’a 

I(a<b)  reduces  to  (6),  where  ve  have  dropped  the  suffix  zero. 

Formulas  (5)  and  (6)  are  continuous  at  a=b.  To  prove  this,  note  that 

^(x)  = berx  + i beim 


and 


are  the  solution  of 


W2(x)  = kero  + i keix 


2 d2n?  du  . 2 

x — ~ + X~  - lx  W~Ci 

dx2  ** 


Ti\e  real  part  of  the  Vronski  an 


w. 


"I 


1 

x 
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gives  the  continuity  of  (5)  and  (6)  at  a=b. 


BARNES'  INTEGRAL  REPRESENTATIONS 


Proposition  1. 


Barnes'  representations  of  J (x), 

r.(rsJ-(2.'.?.)V+gedg 

r(v+s+i)  as 


J (x)  = 0 . 
v 2-m 


J: 


(7) 


is  valid  for  v _>  0 and  x >_  0-  When  v~0,  the  order  of  the  integration 
and  the  substituting  x=0  cannot  be  exchanged* 

Wat son's  restriction  (Ref.  2,  p*  192)  that  R(v)>0  and  x>0  may 
be  loosened,  although  v is  restricted  in  this  proposition  to  real 
numbers  in  order  to  state  simply  the  strange  property  at  x=0*  Note 
that  Watson  prohibits  the  use  of  (7)  for  This  restriction  is 

removed  here. 

Proof* 

We  consider 


r(-e 


lim 
Is  |-*» 


(-) 

K2’ 


v+2i 


r (v+£’+i ) 


(a) 


where  0<jc<°°,  in  order  to  show  that  the  contour  integral  derived  from 
(7)  by  drawing  the  semicircle  of  infinitely  large  radius  to  the  right 
of  the  imaginary  axis  amounts  to  zero. 

Changing  -s  in  f ( ~s ) to  +s  by  using  the  formula 


r(-js) 


“JT 

r(s+l)sinirz 


(8) 
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where  s is  a complex  number  that  is  neither  zero  nor  positive,  L 
becomes 


lim 
|s  |-w» 


—IT 

F(s)sinirs. 


v+2s 

(ar/g) 

r( v+s+i) 


Letting 


i 6 

s - re 


(b) 

(9) 


ve  get 

lim  | sinus j = lim  ~ Vox  {e^VS  te~^ 178 ) = lim  jjj-  e7n',lsinel  (c) 

p-X50  p->00 

Substituting  (c)  and  the  asymptotic  expansions  of  T(s)  and  F(v+s+l), 
(b)  becomes 


where 


L . - lim  |-e^l| 


(d) 


^ = (v+2s)log~  + (v+2s+l)  - Tir|sinf)|”  (s-i)logs  - (v+s+~)log(v+s+l) 
which  we  transform  to 

A1  = (v+2s)log  ~ - (v+s4~)log  - iTjr» | sin© I + 1 (e) 

Taking  the  real  part  of  A t we  find 

. v+2rcos0 
QX\ 

~ lim  (pr)  exp[r(26sine-  tr  | sin0 1 ) + l] 

p-H»  ' * 

Because 

26sin9  - it  | sin0  | <,  0 
and 
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v + 2rcos  9 >.  0 

in  the  range 

- 2.  < e < £ 

2 = “ 2 

it  is  proved  that 

tt 

L ^ = finite  when  v = 0 and  0 = — 

=0  in  the  other  cases. 

Therefore  we  can  conclude  that  Barnes'  theory  "by  use  of  (7)  is 
valid  for  v >_  0,  unless,  when  v=0>  the  order  of  the  integration  and 
the  substituting  x=0  is  exchanged. 


Proposition  2. 


Barnes’  representation  of  H^x  (at). 


7re2(v+l)irt  (1)^  j _ 

% t ’ * 


-R(v)+»i 


-R  ( \) ) -wi 


r(-v-8)r(-s)(-|ia)  ds 


is  valid  for  |arg  (-is)  | <_  - and  v >_  0.  The  order  of  the  integration 
and  the  substituting  2=0  cannot  be  exchanged. 

Watson's  restriction  (Ref.  2,  p.  192)  that  |arg  (-s) |<^-is  made 
exact  in  the  above,  although  v is  restricted  in  this  proposition  to 
real  numbers  in  order  to  state  simply  the  strange  property  at  s=0. 

Note  that  Watson  prohibits  the  use  of  (10)  for  real  values  of  s.  This 


restriction  is  removed  here. 


Proof,  we  consider 

L - lim  |r(-v-s)r(-s)(-~)V+2s  s|  (a) 

in  order  to  show  that  the  contour  integral  derived  from  (10)  by  drawing 
the  semicircle  of  infinitely  large  radius  to  the  right  of  the  imaginary 
axis  amounts  to  zero. 
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Changing  the  negative  arguments  of  I’ ( — v - s)  and  r(-s)  to  the 


positive  arguments  by  use  of  (8),  L ^ becomes 


L - lim 
2 |a|-*» 


M 


v+2s 


r(l+v+s)F(s)  sintT  ( v+s  )simrs 


Using  (9)  we  get 

lim  | simr(v+s  )sinirs| 

p->oo 


iim  £ 
-£■+00 


2ttt  sin0 
e 


(b) 


(c  ) 


Substituting  (c)  and  the  asymptotic  expansions  of  F(l+v+s)  and  r(s), 
(b)  becomes 

A? 

= 2tt  lim  | e | (d) 

p->oo 

where 

A ^ = (v+2s)log  — + (v+2a+l)  - 2Trr|sin0|  - (v+s+l)log( v+a+l)  - (s- 
vhich  we  transform  to 

A2  = (v+2s  )log  - (v+s4^)logV+g+-]-'  - 2itr  | sinG  | +1  (e) 


Taking  the  real  part  of  A , we  find 


where 


= 2tt  lim 

r>-H» 


-%z 


v+2rcos0 


2rB+l 


2r 


B = - arg (-is)  sin0  + 0sin0  -Tr|sin0 


Because 


B < 0 


and 


v+2rcos9  0 

in  the  range  -tt<20<tt it : is  proved  that 
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ro|M 


lim | | = finite  vhen  R(v)  = 0 and  0 = — 

7^>oo  c- 

= 0 in  the  other  cases. 

7T  71 

If  V = 0 and  0 = — or  - — , L may  be  finite  but  never  becomes  infinite. 

Therefore,  we  can  conclude  that  Barnes'  theory  by  use  of  (10)  is 
valid  for  v>.0  and  | arg  unless  the  order  of  the  integration  and 

the  substituting  z = 0 is  exchanged-  Q.E.D. 


We  will  show  in  the  following  examples  that  indiscriminate  use  of 
Barnes'  integral  representation  is  dangerous. 

According  to  the  theorem  stated  above,  we  have  the  indent ities, 


J.)  (ax)  J ( bx ) dx  = f!j- 
1+Xk  1 0 2 


r h1(ox)  J (bx)  dx 
4 1 O 


when  a>b , and 


TO  OO 

jj-  J (crc)J  (Zw;)  dx  = j f - 1—  J (ax)  H^1  ^ (bx) 

1+x  J 1+x  ° 


when  a>b . The  ranges  of  integration  in  these  integrals  are  restricted 


on  real  axis;  the  circular  contour  in  Figure  1 is  not  considered. 

Substituting  the  respective  Barnes'  representations  on  the  right 
hand  sides  of  ( 11 ) and  (12),  and  changing  the  order  of  integrations, 


yields  the  results  in  Example  1,  as  will  be  shown  in  Example  2. 

Substituting  the  respective  Barnes'  representations  on  the  left 
hand  sides  of  (ll)  and  (12),  and  changing  the  order  of  integrations, 
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does  not  necessarily  yields  the  results  in  Example  1,  as  will  he  shown 
in  Examples  3 and  H.  The  difference  is  caused  hy  the  pole  of  H^(am) 
at  m=Q. 


The  correct  result  must  he  the  one  found  hy  substituting  the 
respective  Barnes'  representations  into  the  right  hand  sides,  because 
the  integrals  on  the  left  hand  sides  of  (ll)  and  (12),  considered  as 
functions  of  a and  b , are  continuous  at  a-b.  This  condition  is  not 
satisfied  by  substituting  the  respective  Barnes'  representations  into 
the  left  hand  sides,  hut  into  the  right  hand  sides,  as  shown  in  Example  1. 
Example  2. 

We  shall  show  in  the  following  that  use  of  Barnes'  theory  to 
integrate  the  right  hand  sides  of  (11)  and  (12),  which  are  expressed 
here  as 


CO 


I(a>b)  - ~ f JQ(&c)ds 


and. 


I{a<b) 


s 1 f 1. 

V 1+3! 


^ J (aa?)  H ^(bx)  d x 


(13) 

(lM 


yields  the  same  result  as  given  in  Example  1 by  use  of  the  contour 
integral  method. 

(l) 

Using  Barnes'  representation  (10)  and  (7)  for  ' (ax)  and  J Q(bx) , 
respectively,  and  changing  the  order  of  Integration,  integral  (13) 
becomes 
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I(a>b)  * ± 


J r(-i-s)r 


(-s  )(-*£) 


-l-“t 


coi  h 2t 

1 > ''<-*)$•  ! 
ds  2^  / rTtTiT"  Ki  dt+I 


where 


. u» 

e+o  ./  1+a 


l+2s+2t 


l+2s+2t 


dm  + lim 


e->o  ./  1+x 
*4*00 


The  last  term  on  the  right  hand  side  of  (a)  results  from  the  existence 
of  the  pole  at  x = 0 in  the  integrande  of  (13).  The  residue  at  this 
pole  is  evaluated  by  drawing  a semi-circle  as  in  Figure  1. 

Changing  s to  s-1  in  the  first  integral  in  (a),  I(a<b ) becomes 


-l+2s 


r(-s)r(-s+i)(-i) 


<» i 

C r i-th) 


r ( t+i ) 


K d t+J 
1 a 


-l+2s+2t 


-l+2s+2t 


dx+  lim 


e->o  J 1+x 

— CO 


Changing  x to  -x  in  the  second  integral  of  (d),  K ^ becomes 


in(-l+2s+2t) ) f x 


-l+2s+2i 


4 / \ 

Letting  i=x  , (e)  integrates  to 
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*L  = t [l~e 


-2iir  (s+t  \ 


j'frH1  - ¥) 


vhich  transforms  to 


2£ir  (s+t)\ 


Kx  = f (1-e 

\ 

. tri(s+t) 

= - ^7Te 


sin^s+t) 


(f) 


cos  — (s+t) 


Substituting  (f),  (c ) decomposes  into  products  of  single  integrals. 


Ha>b)- |(v8+vJ*S-  (s) 


where 


r .1  f h) 

1 " 2iri  j i 

-on. 


-1+2  s 


rTs+TT 


e r(-s+l) 


sin 


its 


ds 


■’  fb\2tivt 


ZSm 


7T  007, 

o r tst  e 


_2 

27t£ 


ds 


(r(t+i)) 


sin 


ut 


T = 


7T_ 

2 


. / . \ -l+2s . 

°°v(ta 


3 2tii 


/a 

-007, 


r(-s+i) 


r(s+i) 


cos- 


TTS 


ds 


J 


II  ruj/_s!ll 

(r(4+i)  )2  cosl| 


dt 


-“t. 


The  integrand  of  J has  a single  pole  at  s 
a double  pole  at  s = 2n,  where  n = 1,  2,  ... 

, -l+2s 


f(s)  = 


MI 


0 and  s = 2n+l  and 
Let 

(h) 


r(6+i)  r(s) 
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Then  he comes 


J1  = + ^ (_l)n  f' (2n)  ’2  (-l)n/(2n+l)  (i) 

n= 1 n=o 


The  first,  second,  and  third  terms  on  the  right  hand  side  of  (i)  are 
the  residues  at  8 - 0,  s = 2n,  and  s - 2n+l , respectively.  Substi- 
tuting (h),  (i)  becomes 


I1  = -2iker'a 


where 

ker'x 


= _l,lV  (-1)  ,£x 

a:  U ^ (2w)!(2n+l)!  2; 


lm-1 


n=o 


im+3 


00  X 

E(-l)”(2) 

(2n+l )J (2n+2) ! 


log  | - |^(2n+2)+<M2n+3)) 


n~o 


The  integral  ^as  a single  pole  at  £ = 2n,  where  n = 0,1,2,  .. 

Counting  the  sume  of  the  residues,  becomes 

I2  = ber  b 

The  integral  I has  a single  pole  at  s = 2n  and  a double  pole  at 

s = 2n+l,  where  n = 0,1,2,...  . Counting  the  sum  of  the  residues. 

1^  becomes 


= -2 i kei'a 

where 
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00 


kei'x  = 


, n , _ 
n,Xs4  +3 

(-1)  2 

( 2n+l ) ! ( 2n+2 ) ! 


n=o 


( 


-D»<i> 


'kn+l 


(2n) ! (2n+l) S 


n-o 


...  x 1 
loe2  - 2 


^(2n+l)+*(2n*2)j! 


The  integral  has  a single  pole  at  t = 2n+l,  where  n = 0,1,2,...  . 
Counting  the  sum  of  the  residues,  1^  becomes 

1^  = - bei  b 

Substituting  the  above  values  of  1^,  I ^ and  I ^ into  (g),  we  can 

show  that  I{a>b)  in  (13)  is  given  by  (5). 

Using  Barnes'  representations  (7)  and  (10)  for  J^(ax)  and  (&x), 

respectively,  and  changing  the  order  of  integration,  integral  (lH) 
becomes 


l{a<b ) 


(r(-t))2 


v 2 ‘ 


„ 1+2S 
(2.) 

f(-,q  T 2;  K2dS 

r(s+2) 


U) 


where 


*2  = 


oo 

■/ 


l+2s+2t 


x 


dx 


1+x 


Integration  of  Kg  yields 


Kg  = - iir  e^ir(s+^  sin  ^(s+t)  (k) 

Substituting  (k),  (j)  decomposes  into  products  of  single  integrals, 

l(a<b)  = - | I2  + J3  -Z\)  (1) 
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where 


2L 

2 


J1  2v  i 


cxy t 

L 


1+2  s ^^T  s 
a \ e 


7T  s 


d s 


r(s+2)r(s+2)  cos  2 


and 


2 
rr  \ 


_\IL  /* 

“ 2id  I 


(-i-b\  ^ ivt 


ri) 


^F(t+l)jf  sin-rrt  sin-^ 


d t 


^3  27ri 


-tt_ 

2 


if 


a\l+2s  ins 
& 


M. 


-~S  (r(s+l)r(s+2)  sin- 


ds 


/irX1"  °°i  /_£M 

3 (iL  r ( jz 


it 


2 m. 


2i 


£ 


ini 


|r(t+l|?  si 


sinntcos- 


ni 


di 


They  integrate  to 


1^  = her 'a 
1 2 =-2  keri> 
I^  - bei'  a 

and 


1^  = 2 keib 


Substituting  1^,  I^,  I^j  and  1^  in  the  above  we  can  show  that 
l(a<b ) in  (l4)  is  given  by  (6). 

Example  3. 

We  shall  show  in  the  following  that  use  of  Barnes'  theory  to 
integrate  the  left  hand  sides  of  (ll)  and  (12),  which  are  expressed 
here  as 
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1 


(15) 


yields 


1+x 


J, (ax)  J ( bx ) dx 
i o 


J = ber  (~  + ker'  a)  - bei b kei'  a 

when  a>b 

= ber'a  ker b - bei' a kei£> 

when  a<b 


(16) 


(17) 


The  result  (l6)  for  a>b  does  not  agree  with  (5)*  The  result  (17)  for* 
a<b  agrees  with  (6).  The  difference  is  caused  by  the  pole  of  H^(am) 
at  m=0,  which  is  not  counted  in  (15)*  The  meaning  of  the  value 
of  (l6)  is  not  yet  known. 

We  shall  use  a single  Barnes'  representation  for  the  product 
J^(ax)  J Q(bx)  in  the  example.  Two  expressions  are  available  (Watson 


2,  p.  148). 


Jl(“>  Jo(te)  , 2n+l 

°°  / xrifbxy 

- iZ 

n=o  (n! ) 


F ( -n , -n ; 2 ; 


(-l)n  (2§)2n+1 

n!  (n+1 ) ! F^'n’  "n_1;  1;  T* 

n=o  a 


-z 


(a) 

(b) 


where  F(  , ; ; ) denotes  a hypergeometric  function.  Both  hyper geometric 
functions  are  polynomials  of  nth  order. 

Use  of  (a)  yields  the  integral  for  a<b , as  shown  in  the  following. 

The  integral  (15)  for  this  case  will  be  denoted  by  J(a<b)  in  anticipation. 
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Transform  (a)  to  a Barnes1  representation: 


J1te)Jo(te>  = F 2w£ 


°>i  bx_2s+l  x 

- f r-l~s)  '2 f/-s,  -s;  2;  2^Ad£ 

jU  r<s+1>  V » / 


Substituting  this  into  (15)  and  changing  the  order  of  integration,  we 


J{a<b)  = r 


00  lb\  2s +1 

a 1 rr(-e)V2/ 

£>  2iri  / f(s+l) 


f °° 

ds/": 

i J. 


We  must  examine  the  asymptotic  behavior  of 
2 


F(-e,  -e;  2;%) 
2? 


",  — s — 1 1+s  2 s 

r(-s)r(2+s)  [ 5 ^1“^2  ^ d5 


For  — 41  , we  have 
2> 


2 ' 

1-—  5 

2>2 


because  we  assume  R(s)  >_  0.  Therefore  we  have 


T ’-Si2V/41 

for  any  values  of  a when  a<£>.  To  prove  (e),  note  that 


f rS_1(l-01+S  dE.  = r(-e)r(2+e) 

•'n 


Letting  a:  = E,  , the  second  integral  on  the  right  hand  side  of  (d) 


is  integrated  to: 
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Letting  x = £ , the  second  integral  on  the  right  hand  side  of  (d) 
is  integrated  to: 


oo 

/ 


2$+l 


x 


dm  = 


IT 

IT 


l+m 


cos- 


TTS 


Thus  (d)  becomes 


2 
IT 


J{a<b}  ‘ ST  ffii3) 


d s 


"-00^ 


7KS*  . 

cos-^simrs 


(D 


where 


(|)2S+1  / 2N 

^1(S)  = f "I F 

1 ® r2(s+i)  \ £2 


(s) 


For  a<p  we  may  apply  the  residue  theorem  to  integrate  (f).  The 
integrand  of  (f)  has  a single  pole  at  s=2n  and  a double  pole  at 
s = 2n+l,  where  n = 0,1,2,...  . Thus  (f)  integrates  to 


J{a<b ) = 


(-!>”  fpM  + (-l)n/{(2n+l) 


n=o  - 


n=o 


th) 


Substituting  (g),  (h)  becomes 

J(a<b)  = |-  ^ + log~  - M1  + j N± 

hn+i 


where 


n= o 


( i)n(l}  ^ 

- 1 - P F ( -2n , -2n ; 2;  — ) 
( (2n) ! )2  t2 


4n+3 


a 


(-Dn(l) 


L = rV  V~X;"  — „ F(-2n-l>2n-l;2;  ~) 

1 * Zy  ((2n+l)!)2 


n=o 


(i) 
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n(-)  2 

M - g-  V F ( -2n-l , -2n-l  j 2 ; ~) « ( 2n+2 ) 

1 *“-(2»+Uir  b2 

n-o 


"rlE— Ik*-.,-.,  * 4] 

1 ® ^ ((2n+l)l  )2  9 b2 

n=o  v l J s=2n+l 


Expressing  F(-2n,  2n;  2;  — ) as  a polynomial,  K "becomes 

bd  1 


°°  4n+l  2 n 

^ ■7!  (“i)n(|)  V V~ 

1 ^ 2 " ((2n~k) ! jkl (k+l) ! 


Dividing  the  second  series  into  two  series  with  regard  to  even  integers 
and  odd  integers,  K ^ becomes 


4n+l  n 


i=E<-»"‘f>  E 


i (2.) 

(2n-2p)!)2(2p)l(2p+l)l  b 


°°  Un+1  n 

L(.i)n(^  V' i 

\ ±1  \2,  ^ ( 2n~2p+l ) ! ( 2p-l ) ! ( 2p ) ! 

n=l  p=l 


Letting  n = p+p,  X becomes 


:DP  t 

[2p+nr  v 


((2q)l) 


i ^ • 


II (2p-l)l  v2; 


=ill_  <1>+2 

.,x.v2  '2 


( (2p+l) ! ) 
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Thus  we  get 


K = bei'akerZ?  + her 'a  bei  b 


(J) 


CL 

Expressing  F(-2n-l,  -2n-l\  2;  — ) as  polynomial,  L becomes 

ZT  X 


bn+3 


2n+l 


2k+l 


h - £ (-lf&  L 


n=o 


fe=o 


((2  +l-k)l)  fc!(fe+l)l 


(?-) 

V 


Dividing  the  second  series  into  two  series  with  regard  to  even  integers 
and  odd  integers. 


kn+ 3 


- £ (-i,B(l>  £ 


<f> 


n=o 


p=o 


( ( 2n+l-2p ) ! ; ( 2p ) ! ( 2p+l ) ! 


Sp+1 


i+n+3  " 


+ £ (-i>n(|)  £ 


n=o 


p=o 


..  ^ (2-) 

((2n-2p)t )2(2p+l)t(2p+2)l  b 


kp+ 3 


Letting  n = p+q  , becomes 


£ V1  v (-i)^  b ^ 
5 £ «*♦!>.>*  (J> 


+ 


*£ 

n — 


(-DP 

( 2p+l ) I (2p+2) ! 


(-) 

V 


%>+3 


£^Di(|)^ 


Thus  we  get 
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L = bei'a  beib  - ber'a  ber£> 


Next  we  compute 


V 


3 a 2 1 

^F(-S,  -s;  2; 

b 


s=2n+l 


which  is 


°1  = 


is  Z-/  k) 


((-s)k  ) 2k\ 

. .K  (<±.\ 

(k+l)l  V 


k= o 

By  differentiation, 


J s-2n+l 


(fe+l)]  V Z-f 

/c=o  h=o 


*-*E« 


2n+l-h 


Thus  we  find: 


-Mi  + f *i 


hn+3 


n=o 


where 


2n+l 


■ E (-il"(l)  E 


r-)  p 

V n,k 


2k+l 


k=o 


{{2n+l-k)l  yk\  {k+ 1): 


k- 1 


pnx  - +(2"+a>  * Esthr 


h=o 


which  transforms  to 


Pn ik=  -i|)(2n+2-fc) 


Thus  we  have 


U) 
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kn+3 


2n+ 1 


- Af  ’ + 


h = - e e 


n=o 


k=o 


ifr(2n+2-fc) 

( ( 2n+l~k ) ! )2k\{k+l)\ 


(-) 

V 


2k+l 


Dividing  the  second  series  into  two  series  with  regard  to  even  integers 
and  odd  integers, 


hn+3 


n 


-M 


1 2 


*1  = -E(-i)”(f)  E 


4>(2n+-2-2p) 


n=o 


P= o 


((2n+l-2p)! ) (2p)! (2p+l)! 


V 


4p+l 


-E(-i)n<F'J  E 


4n+3 


n 


ij;(2n+l-2p) 


n=o 


p=o 


((2n-2p)r(2p+l)!(2p+2)! 


(£L) 

V 


4p+3 


Letting  n = p+q,  ^ "becomes 


(-l)q^(2c7+2)  ( b }^+2 
^((2^1)!)2  2 


(-l)^(2q+l)  ,b_,kq 
«2«),a  2 


= - bei’a 


^Lj  ^ei^\(7+2 
q=o 


\l>(2q+2 ) + 


+ her 


(her&)^(2q+l) 


(1) 


where  (beii)^^^  and  (berfc)^  denote  the  (4<q+2)th  and  (tqHh  .order 
terms  of  the  series  of  bei£>  and  beri>  respectively. 
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Substituting  (j),  (k),  and  (l)  into  (i)  we  find  that  J{a<b)  is 
equal  to  (17)* 

Use  of  expression  (b)  of  J^as)  JQ(bx)  to  integrate  (15 ) yields  the 
integral  for  a>b,  which  will  be  denoted  by  J{a>b)  in  anticipation. 
Transform  (b)  to  a Barnes'  representation: 

2s+l 

(m) 


1 7 b2 

Ji'“)Jo(te)  = / ' rTs+gy—  f(-s  ,-E-i  ds 

*/  • a 

-00^ 


Substituting  (m)  into  (15),  and  changing  the  order  of  integration,  we 
have 

2s+l 

°°7-  _/  w<Z> 

. r r(-s)(- 

J (a>b) 


i 7 v x *- 

• 2^r  / F(-s’-s-1;li  -2)  H — (n) 


2s+l 


a 7 1+x 

•'o 


It  can  be  proved  that 

h2  I 

F(-s,  -e-l;l;~)  I <1 

a 

for  a>£.  Following  the  similar  procedure  to  the  previous  case,  we  get 


™4j>>  -it8 

■ -o*i 


cos-g  sinus 


where 


(-) 

V 


2s+l 


“ r(s+i)r(s+2)  F^_s’"6'1’1;a2^ 

It  integrates  to 
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J{a>b)  =lK2  + L2  logf-M2  + ! N2 


(o) 


where 


- 


■E 


n=o 


(-l)"(f)Un+1  .2 

F(-2n,  -2n-l;  1;  ~) 

a 


( 2n ) ! ( 2rc+l ) ! 


>*E 


(-Dn(|) 


im+3 


'2  / ^ (2n+l ) ! (2n+2) ! 
n=o 


7 F(-2»-l,  -2«-2.;  1;  ~) 


a 


kn+ 3 


_^(_l)n(|-)  2 

M2  = F(-aK-1>  -^'Ss  1; 
n= o 


and 


*2=  E 

n=o 


(-i)n(|) 


im+3 


( 2n+l ) ! ( 2n+2 ) ! 


3 7-7  / T T ^ \ 

3s  F^S’  "S“1;  1» 


s=2n+l 


Following  the  similar  procedure  to  the  previous  case  we  find 
= bei’ a ber£>  + her’ a berZ? 

= bei'  a bei b - her'  a herb 


9 t b \ 

Js  F^3’  -S-U  li  "g) 

a 


s=2n+l 


2n+± 


k- o 


h 2 k 

(2n+l)!(2n+2)!  (-) 
fr+l-k)H2n«>-k)l  (W)2  ^ 


k-1 

E 
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~M  2 + t N2 


= herb 


(ip(2q+2)  + \li(2q+3 ) ) - 


- beifc 


(bei-'  a) 

q=o 


hq+1 


(^(2q+l)  + f(2q+2)) 


Substituting  these  formulas  into  (o),  we  find  that  J(a>b)  is  given 
by  (16) 


Example  U 

We  shall  use  in  the  following  two  Barnes'  representations  to 
integrate  (15).  Using  (?)  for  both  J^[ax)  and  JQ{bx)  and  changing 
the  order  of  integration,  (15)  becomes 


J = 


(2t ri  )‘ 


^r(-s)(|) 
r(s+ 2) 


l+2s 


-0*1 


ds 


Sf-tM  |) 

r.;*+i) 


/- 


1+2 1 

00 

— «J 


l+2s+2t 


x 


dx 


(a) 


1+x 


Two  cases  occur  in  accordance  with  the  order  of  integration.  We 
shall  show  that,  when  integration  with  regard  to  t,  or  s,  is  performed 
first,  J yields  (l6),  or  (17),  respectively.  These  integrations  will 
he  denoted  hy  J{a>b)  and  J(a<b),  respectively,  as  in  the  previous 
example. 

To  prove  this,  let 

s + t = u (b) 


in  (a).  Assuming  that  integration  with  regard  to  t is  performed  first, 
we  transform  (a)  to 
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00 


J = 


3 7 r(-u)(f) 


1+2  u 


j 


2H  I r'(w+2f 


F du  f 

J 


X 


1+2  u 


1+2 


dm 


(c) 


where 


^ Of 

F s -L-  f r(t-h)  r(u+2)  r(-t) 

* 2vi  I t(-u)  r(u-t+2  Tti+lT  V dt 


(d) 


We  shall  show  that  F in  (d)  transforms  to 

h2 

F = P(-ut  -u- 1;  1}  —•) 

a 

To  prove  (e),  apply  (8)  to  r(w+2)  and  f(u  - i+2)  in  (d),  and  we 
have 


(e) 


&>% 


F = 


aJi  r(-w)r((-w-D  f ^ 

*L<&i 


at 


sinirt 


(f) 


where 


^ ' ' r2(t+l)  « sin"(u+2) 


On  integration  (f)  becomes  (e).  Then  c/  in  (c)  is  equal  to  J(a>b).  The 
other  case  can  be  proved  similarly. 

In  the  following  we  shall  carry  out  the  integration  of-J(«>t). 

In  this  ease  we  first  perform  the  integration 


2 

if 


M m 


T 


Siili  r87*+i)ooA^  “in”* 


(“) 

V 


2t 


dt 


(g) 


which  is  found  by  substituting 
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00 


X 


l+2s+2t 


Ax  = 


1+x 


IT 


IT 


TT 

COS— 


(8  + t) 

2 


into  (a).  Because  s is  imaginary,  the  integrand  of  (g)  has  a single 
pole  at  t = n,  where  n = 0,1,2,  ...  . Thus  (g)  becomes 

2n 


M = 


■n= o 


<«t  >2 


,mr  us , 
cos(—  ) 


Dividing  the  above  series  into  the  series  of  even  integers  and  odd 
integers,  M becomes 


M = 


+ beiZ> 


Substituting  this  result,  J{a>b)  becomes 


2 

J(a>b)  = ^ (^beri  + N^eib) 
where 


2iri 


f. 


l+2s 


(-) 

V 

- r(s+i)r(s+2) 


ds 


simrs  cos- 


TTS 


and 


*2  = 


J 


(-) 
k2  1 


l+2s 


ds 


r(s+i)r(s+2) 


- °n- 


sinus  sin; 


TTS 


The  integrand  of  N has  a single  pole  at  s = 2k  and  a double  pole 
at  s = 2n+l,  where  n = 0,1,2,  ...  . Counting  the  sum  of  the  residues, 
becomes 
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AT  = \ (ker  'a  + k 
± a 

The  integrand  of  has  a double  pole  at  s = 2n  and  a single 
pole  at  3 = 2n+l,  where  n = 0,1,2,  . ..  . Counting  the  sum  of  the 
residues,  becomes 

= - “p  kei*  a 

Thus  we  can  conclude  that  J(a>b ) is  given  by  (l6). 

Similarly  we  can  find  that  J{a<b)  is  given  by  ( 17 ) - 
Example  5- 

We  shall  show  in  this  example  complicated  variants  of  (l)  whose 
integration  procedure  is  not  obvious. 

The  most  complicated  integrals  we  encountered  in  the  study  of 
viscoelastic  deformation  of  the  floating  ice  plate  (Ref.  l)  are: 


00 

f 

r’  = / 


T-a 


~f  /d{x) 

o 


-a  (x)t  i 

(l-e  ) J A ax)  J (fcar)ar  dx 


(18) 


and 


oo 

/x-a, 
/Z>(x) 


-a.  (x)t  v 

(l-e  ) J (ax)  J (bx)x  dx 


(19) 


Here  viscoelastic  constants  2?  , and  are  used  to  define 

nP  ffp 

T = — (l+jj£) 


£ = 


h*E2 
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D(x ) = [t(iW*)-1]2  + 


a^x)  = 


tx  + 1 + t - /P(x) 

2(E+xk) 


+ I + T + /P(x) 

2{E+cck) 


and.  t is  time.  Integral  has  essential  singularities  at  the 

k 

of  x = -E , because 


lim  cl  (x)  = 00 
But  does  not,  because 


lim  a.  (x)  - 7—7 


x^->-E 


1+t (1-E) 


Both  integrals  simplify,  as  shown  in  Reference  (l),  to 


V 


h-if 


l-e 


-x 


x J 1(af(x))  J Q(bf(x))  f(x)  dx 


and 


J2=¥ 


V 

/ 

T t 


1-e 


-X 


x J1iaf(x))J0(bf{x)  fix)  dx 


where 


5hl 


roots 


(20) 


(2l)  e 


_JL 

2 E 


1 + T 


_1 

2 E 


1 + T 


+ 


and. 

fix)  =(e  • 


\pt-x  X t-ar 

T t-X  X 


1 

4 


Parameters  X^,  and  t are  in  the  range 
0 < ^ < t < X^ 

Integrals  1^  and  as  shown  by  (20)  and  (21),  respectively, 

are  continuous  at  a=b  a condition  that  must  be  satisfied  by  the 

results  of  the  integrations.  However,  it  is  not  clear  whether  the 
direct  substitution  of  the  Barnes1  representations  of  (7)  for  J (a fix)) 
and  J ibfix))  does  or  does  not  yield  the  results  satisfying  this  con- 
dition. Change  of  J^(a/(*c)),  or  J^ibfix)) , to  respective  Hankel 
functions  may  be  achieved,  not  by  simply  extending  X to  -X  but  by 
choosing  a complicated  contour;  I am  not  sure,  however,  if  I should 
dare  to  do  this.  I am  in  trouble! 
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SINGULAR  PERTURBATIONS  IN  HEAT 
CONDUCTION  AND  DIFFUSION  PROBLEMS 
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Fluid  Mechanics  Branch 
Applied  Mathematics  and  Sciences  Laboratory 
U.S.  Army  Ballistic  Research  Laboratories 
Aberdeen  Proving  Ground,  Maryland  21005 


ABSTRACT.  In  one-dimensional  problems  of  diffusion  or 
heat  conduction  where  discontinuities  or  steep  gradients  occur 
in  the  initial  or  boundary  conditions  a singular  perturbation 
analysis  can  give  accurate  estimates  of  the  solution  when  numeri- 
cal methods  prove  inefficient  or  inadequate.  In  fact  the  discon- 
tinuities can  be  exploited  in  the  singular  perturbation  analysis 
to  obtain  an  asymptotic  series  representation  of  the  solution. 

Several  different  problems  of  increasing  complexity  can 
be  explicitly  solved  when  the  boundary  and  initial  data  are 
given  in  piecewise  polynomial  form:  a.)  infinite  region  or  pure 

initial  value  problem  b.)  semi- infinite  region  and  c.)  finite 
region . 


The  approximate  methods  also  apply  to  the  case  when  no  dis- 
continuities occur  in  the  prescribed  data  or  its  derivatives. 

1.  INTRODUCTION.  It  is  frequently  stated  in  regard  to 
heat  conduction  and  diffusion  problems  that  "discontinuities  are 
immediately  damped  out".  In  some  problems  arising  in  engineering 
however  this  view  is  too  over-simplified  to  be  realistic  because 
the  damping  out  process  itself  is  the  heart  of  the  problem.  Typically, 
such  problems  exhibit  a behavior  usually  referred  to  as  "very  steep 
gradients"  which  present  severe  difficulties  to  attempted  solution 
by  numerical  techniques;  viz.,  very  small  mesh  sizes  and  excessive 
roundoff  errors.  We  shall  examine  in  this  paper  how  such  problems 
are  most  suitably  handled  by  a singular  perturbation  analysis. 

The  analysis  will  show  how  very  accurate  estimates  of  the  solution 
can  be  obtained  with  just  a few  easily  calculated  terms. 

We  first  wish  to  give  an  example  of  the  type  of  problem 
we  have  in  mind,  namely,  heat  conduction  in  rifle  barrels.  We 
assume  circular  symmetry  and  independence  of  the  axial  coordinate. 

The  phenomenon  is  then  described  by  the  heat  conduction  equation  in 
radial  coordinates : 

ut  = a[urr  + ? ur] 

The  radial  coordinate  varies  from  rQ  = interior  radius  of 
the  barrel  to  r^  = exterior  radius.  The  initial  and  boundary  (radia- 
tion) conditions  are 
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u(r,0)  = ambient  temperature 

u(rQ,t)  + h0ur(r0,t)  = propellant  gas  temperatures  at  time  t 
u(r^,t)  + h^ur(r^,t)  = ambient  temperature 

where  hQ  and  h-,  are  heat  transfer  coefficients.  The  coefficient  of 
thermal  diffusivity,  a,  for  mild  steel  is  0.12  cm^/sec.  A typical 
length  scale  in  this  problem  is  r^-r0  which  is  on  the  order  of  1 cm. 

A typical  time  scale  is  the  duration  of  the  phenomenon  which  is  on 
the  order  of  1 millisecond.  In  non-dimensional  form  then  the  coef- 
ficient a is  on  the  order  of  10"4.  Although  one  usually  thinks  of 
metal  as  being  a good  conductor  this  problem  is  one  in  which  the 
conductivity  may  be  considered  as  very  poor  due  to  the  short  dura- 
tion of  the  heat  pulse. 

An  analysis  of  the  behavior  of  solutions  of  equations  of 
the  general  form 

ut  = e[a(x)uxx  + b(x)ux  + c (x)  u] 

has  been  undertaken  by  the  author  and  explicit  formulas  for  such  pro- 
blems have  been  obtained  through  singular  perturbation  techniques. 
However,  in  the  present  paper  we  shall  deal  only  with  the  diffusion 
equation 

ut  = £Uxx 

due  to  space  limitations.  The  behavior  for  this  case  is  simpler  than 
the  general  case  but  typical.  At  first  only  the  Cauchy  problem  (infi- 
nite rod)  will  be  analyzed.  Later  it  will  be  shown  how  extensions  to 
mixed  initial-boundary  problems  are  easily  accomplished. 

There  has  been  some  treatment  of  this  type  of  problem  in  the 
literature  - see,  for  example  references  [1] - [4] . All  of  these  only 
develop  the  first  order  approximations  however  and  require  more  strin- 
gent conditions  on  the  data  than  we  have  found  necessary.  In  particu- 
lar the  case  where  discontinuities  arise  between  the  boundary  data  and 
the  initial  data  has  not  been  analyzed.  The  present  treatment  is  very 
direct  in  its  approach  and  all  of  the  approximations  will  be  explicitly 
obtained.  The  precise  effect  of  discontinuities  in  the  data  and  its 
derivatives  will  be  clear  from  the  asymptotic  representations  obtained. 

The  results  presented  here  are  only  partial  and  give  an  indica- 
tion of  the  general  approach.  A full  development  for  the  more  general 
case  will  be  available  in  the  author's  thesis  [5]  which  is  currently 
being  completed  at  the  University  of  Delaware. 


2. 

Cauchy  Problem  for  the  Diffusion  Equation.  Consider  the 

following 

problem  in  the  region  < x < 

»,  0 _<  t < T : 

CD 

ut  = euxx 

for  t > 0 

(2) 

limitt^0  u (x , t ) = 4>(x) 

wherever  <A(x)  is  continuous 

(3) 

7 

| u (x , t ) j M exp  [ax  ] 

for  0 < t < T 

and  - 


00  < X < 00 


where  M and  a are  positive  constants. 

The  existence  and  uniqueness  of  the  solutions  of  this  type 
of  problem  are  thoroughly  discussed  by  A.  Friedman  in  [6] . In  the 
present  discussion  we  shall  be  more  concerned  with  the  computational 
aspects  of  the  problem.  However,  in  passing,  we  should  make  a 
few  remarks  about  those  more  fundamental  questions:  a.)  The 

existence  of  a solution  is  guaranteed  provided  4eaT  <1.  b.)  Con- 

dition (3)  is  sufficient  to  guarantee  uniqueness.  It  is  much  weaker 
than  the  usual  boundedness  condition  specified  for  such  problems 
from  physical  reasoning,  c.)  Condition  (2)  should  be  replaced 
by  a more  general  requirement  if  <Kx)  is  only  locally  integrable. 
However  we  will  consider  only  </>(x)  which  are  continuous  except  at 
certain  isolated  points  where  it  has  well  defined  jumps.  In  such 
cases  condition  (2)  is  sufficient,  d.)  Condition  (3)  applies 
in  particular  to  4>(x)  = u(x,0). 

The  solution  of  problem  (1),  (2),  (3)  can  be  written  in 
the  well  known  integral  representation  form 

co 

u(x,t)  = / F(x-y,et)*(y)dy 


where  F(x,t)  = exp  [-x2/4t]. 

This  integral  is  not  always  easy  to  evaluate  explicitly 
or  even  numerically  so  we  proceed  with  a singular  perturbation 
analysis  to  obtain  easily  calculated  solutions.  We  wish  to 
emphasize  here  that  the  singular  perturbation  technique  can  be 
extended  to  more  general  equations  where  the  fundamental  solution 
is  not  known  a priori. 

3.  The  Functions  Hn,  Hn*  and  vn-  In  this  section  we 

introduce  certain  functions  which  will  be  convenient  later  in 
the  discussion. 

Define  the  initial  value  functions  h (x)  and  h*  for 
n = 0,1, . . . n n 

i xn/n!forx  > 0 
= | 0 for  x < 0 

and 

hJ(x)=  hn(-x) 

The  functions  Hn(x,t)  and  H*(x,t)  are  then  defined  as  the  (unique) 
solutions  of  ut  = uxx  which  satisfy  the  growth  condition  (3)  and 
the  respective  initial  conditions 

Hn(x,0)  = hn(x) 

Hj(x,0)  = h*  (x) 
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Let  the  two  auxiliary  functions  E(x,t)  and  F(x,t)  be 
E(x,t)  = 1/2  erfc  (-x// 4irt) 

F(x,t)  = exP  [-x2/4t] 

where 

erfc  (y)  =Jk 
v ir 

y 

It  is  easy  to  show  by  induction  that  ^ satisfies  the 
recursive  formula 

H0  = E 

Hx  = xE  + 2tF 

Hn  = E (xIin-l  + 2tHn-2^>  for  n > 2 
Another  useful  formula  for  Hn  is 

Hn  ’ 5T  Ivn  E * 2unFi 

where  un  and  vn  are  polynomials  in  x and  t defined  by  the  recursive 
formulas 

u0  = o v0  = 1 

u'j_  - t v-^  = x 

vn  = x vn_i  + 2 (n-1) t vn_2 

un  = x un-l  + 2 (n-1) t un-2 

Similarly,  define  E*  and  F*  by 

E*(*,t)  - E(-x,t) 

F*(x,t)  = -F  (x, t) 

then  we  can  show 

H*  * E* 

H{  = xE*  + 2tF* 

m 5 lxHS-l  * 2tH^-2) 
and  n 

K =irr[vnE*  * 2“nF*l 

The  proof  of  all  of  these  is  routine  so  we  omit  it.  The 
importance  of  the  recursive  relations  is  that  they  reduce  all 
calculations  to  the  evaluation  of  the  well-known  functions 
exp[-x^]  and  erfc(x). 


546 


We  note  here  that  the  polynomials  vn(x,t)  coincide  with 
the  heat  polynomials  discussed  by  Rosenbloom  and  Widder  in  [7] . 
Because  vn(x,t)  is  a solution  of  the  diffusion  equation  ut  = uxx 
and  has  initial  values  vn(x,0)  * xn  then 

(5)  vn(x,t)  = n!  [Hn(x,t)  +■  H*(x,t)] 

Let  L<)»  = <j>xx.  Then  another  convenient  representation  is 

n*  Lk(xn)  tk 

(6)  v (x,t)  = E p 

n k=0  K' 

where  n'  is  the  smallest  integer  not  less  than  n/2.  Note  that  n' 
can  be  replaced  by  any  larger  integer  since  Lk(xn)  = 0 for  any 
k > n/2. 


4.  Outer  Solution.  The  solution  of  (1),  (2)  and  (3) 
is  assumed  to  have  the  asymptotic  representation 

u(x,t)  = 50(e)  u0  + 61(e)u1  + 

where 

= 1 

and 

W^°C5k^)  as  e * ^or  k = 

By  standard  procedures  one  easily  can  show  that  the  only  reason- 
able choice  for  the  asymptotic  sequence  is  6k(e)  = ek.  The 
equations  for  the  uk  are  then  obtained  by  substituting  into  (1) 

(u0)t  - 0 


<ukh  = (uk-lhx- 
with  initial  conditions 


(k  > 1) 


uo(x,0)  = 
uk(x,0)  = 
These  are  easily  solved 
uk(x,t)  = 


♦ 00 
0 

when  4>  (x)  is 
tkLk  <b  fx) 

— kr 


Ck  > 1) 

sufficiently  differentiable, 
(k  > 0) 


The  n-term  outer  expansion  of  solutions  to  (1) , (2) , (3) 
is  therefore  given  by 

n xsaJ/LtM  + o((et)n+1) 


(7) 


n 

u(x,t)  = E 
k=0 
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k 

where  L is  the  operator  L applied  k times. 

This  representation  is  very  advantageous  for  computations 
since  it  reduces  the  problem  of  finding  u at  a point  (xQ,t)  to 
the  evaluation  of  <f>  and  its  derivatives  at  x = x0.  The  error 
is  of  order  (/"eT)2n+1  provided  <f>  has  2n+l  continuous  derivatives 
in  a neighborhood  (xQ  - h,  x^  + h)  where  h > > /"It.  This  claim 
is  not  difficult  to  prove  using  the  integral  representation  for 
u(x,t)  and  growth  properties  of  F(x,t).  We  shall  not  include  a 
proof  here. 

In  passing  we  note  that  for  more  general  operators  such 
as  Lu  = a(x)  uxx  + b(x)  ux  + c(x)u  the  outer  solution  of  u^.  = eLu 
is  given  by  the  exact  same  formula,  (7).  In  fact,  this  representa- 
tion is  a formal  solution  if  n = 00  and  is  a true  solution  whenever 
the  series  is  defined  and  convergent  for  all  x. 

5.  Interior  Layer.  In  the  usual  singular  perturbation 
problem  the  outer  solution  fails  to  satisfy  some  of  the  data 
specified  in  the  original  problem  and  an  inner  solution  is  derived 
to  correct  any  discrepancies.  In  the  present  case  however  the 
outer  solution  satisfies  the  prescribed  data,  given  by  (2),  so 
an  inner  solution  is  not  required  for  the  usual  reason.  This  is 
a rather  unique  feature  of  the  pure  initial  value  problem  under 
consideration  and  should  not  be  expected  in  general.  In  contrast, 
if  we  were  concerned  with  a mixed  BVP-IVP  problem  then  the  usual 
difficulties  would  arise. 

There  is,  however,  another  source  of  singularities  in  the 
behavior  of  a perturbation  solution,  namely  the  presence  of  dis- 
continuities in  the  prescribed  data  and  its  derivatives.  It  is 
apparent  that  this  type  of  difficulty  occurs  with  the  representation 
(7)  since  any  discontinuity  in  <j>  (x)  or  its  derivatives  are  propagated 
into  the  solution  domain  along  the  sub-characteristics  x = constant. 
Solutions  to  the  diffusion  equation  are  known  to  be  infinitely 
differentiable  except  at  the  boundaries  but  (7)  does  not  have 
that  property.  We  are  therefore  led  to  the  need  for  "interior 
layers"  to  correct  the  outer  solution  in  the  neighborhood  of  sub- 
characteristics. In  general  there  can  be  many  points  at  which 
<j>  (x)  or  its  derivatives  up  to  a given  order  do  not  exist.  Our 
analysis  will  assume  only  one  such  point,  namely  x = xQ,  but 
the  results  are  easy  to  extend  to  the  more  general  situation 
because  of  the  linearity  of  solutions  of  (1). 

There  is  another  motive  for  studying  the  effect  of  discon- 
tinuities in  the  initial  data  for  the  Cauchy  problem  which  is  really 
the  more  important  reason.  In  Section  8 we  shall  show  how  m'ixed 
boundary- initial  value  problems  can  be  transformed  into  Cauchy 
type  problems  with  discontinuous  data.  The  results  we  develop 
now  will  thus  be  directly  applicable  to  that  case  and  will  lead 
to  a proper  understanding  of  the  influence  of  the  boundary  data 
on  the  solution. 
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To  proceed  with  an  analysis  of  the  Cauchy  problem  we 
assume  that  is  some  neighborhood  (x0  - h,  xQ  + h)  $ (x)  satisfies 
jump  conditions  in  the  form 


(8) 


*00 


2n 

L 

k=0 

(ak/k!) 

(X  - x0)  ♦ R2n 

for  x > 0 

2n 

I 

k=0 

(bkA!) 

(x  - x0)  + R‘n 

for  x < 0 

where  the  Taylor  remainders  Rn  and  R*  are  0(x^n+1)  as  x 0. 

To  understand  the  behavior  of  u(x,t)  near  to  the  sub- 
characteristic x = 0 we  introduce  an  inner  variable  of  the  form 


x = (x  - x0)//7 


and  an  inner  solution 

(9)  U(x, t ) = U(/7  x + xQ,  t) 
which  is  assumed  to  have  an  asymptotic  form 

(10)  U(x,t)  = U0(x,t)  + 61(e)  Ux (x, t)  + 

where  <5^(e)  = o(l)  and 

^k+1  ( e ) = k 1 1 

Rewriting  equation  (1)  in  terms  of  x gives  the  equation  for 

U as 

(U)  Ut  = eC^l2  ua 

Substitution  of  (10)  into  (11)  leads  to  the  following  equation 

for  U0 

(12)  (Uo)t  - (U0)« 

Initial  conditions  for  U(x,0)  follow  from  those  for  u(x,t): 

U(x,0)  = u(/Tx,0)  = <J>  (/Tx) 


then  from  (8) 
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(13) 


U(x,o)  = 


2n 


-~,k 


I (av/k!)  (/Tx)K  + R2n  for  x > 0 


k=0 


2n 


z (bt/k!)  (/lx)  + R^n  for  x < 0 


k=0 


This  indicates  that  the  proper  choice  for  the  asymptotic 
sequence  in  6k(e)  = (/"i")*  and  the  asymptotic  series  for  U is 

U(x,t)  = UQ(x,t)  + /7U1  (x,t)  + eU2(x,t)  + 

Substituting  this  expression  into  (11)  and  comparing  with 
(13)  we  obtain  equations  and  initial  conditions  for  Uk: 

(Vt  - (ukhx 


Uk(x,0)  = 


(a^/k! ) xK  for  x > 


~k 


(bj,/k!)  x for  x < 0 
Solving  for  Uk  in  terms  of  Hk  and  Hk*  we  have: 


and 

(14) 


Uk(x,t)  = ak  Hk(x,t)  + bk  HjJ  (x,t) 


2n  k 

U(x,t)  = z (/7)  [ak  Hk(x,t)  + bk  H*  (x,t)] 
k=0 


* 0 ((/T)2ntlj 

Relating  u and  U by  (9)  we  then  setx  = x - xQ  and  obtain 
u(x,t)  = U (x//l,  t) 

2n  k 

(15)  = : E (/T“)K  [ak  Hk(x//7,  t)  + bk  H*  (x//7,t)j  + 

+ 0 (/l) 2n+1)  as  e -*■  0. 

This  representation  has  been  derived  only  in  a formal  way 
but  it  can  be  rigorously  justified  for  | x - xQ J < h.  We  now  wish 
to  indicate  how  (15)  can  be  written  in  the  form  of  the  "outer 
solution"  (7)  plus  a correction  term  (which  we  call  an  "interior 
layer")  to  account  for  the  discontinuities  in  <j>  (x)  or  its  derivatives 
at  x = xQ.  Letting  dk  = ak  - bk  (15)  can  be  rewritten 
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LU  , 

u(x,t)  = z (/T)k  av[H,  (x//7,  t)  + H£  (x//7,t)] 
k=0  K K 

- t (/7)k  dkH?  {i./'Ti,  t)  + 0((/T)2nn) 
k*0  K K 

211  V 

= z (/7)k  (ak/k!)  vk(x//7,  t) 
k=0  K K 

I dkHf  (x//T,  t)  * 0((/7)2ntl) 
k=0  K K 


But  vv(x//7,  t)  is  a solution  to  u+  = eu  and  sirailar  to  (6)  we 
have  K t xx 


vk(x//7,  t) 


hence 


u(x,t)  = Z (/T)KCak/k!)  ( I 
k=0  i = 0 


7)V 


-I  (>r7)k  dkH{*  C*//T.  t)  ♦ 0 ((/I)2"*1) 
k-0  K K 

Because  k <_  2n  implies  k'  <_n,  k*  can  be  replaced  by  n.  Then 
reversing  the  order  of  the  summation  gives 


u(x,t)  = Z Ctt)  t1  (eL)1  [ Z (ak/k!)  xK] 
i*0  k«0  K 

2 xi 

- t (/7)k  dk  (St//7,  t)  * 0 ((/7)2n*'1) 

k-0  K K 

n • , 2n  v 

- Z (1/i!)  (et)V  [ Z (ak/k!)  xK] 
i=0  k»0  K 

2n 

- Z (>/T)k  dk  Hj>  (x/V7,  t)  + 0((v'7)2nn) 

k=0  K K 

The  expansion  (13)  of  the  initial  value  function  can  be 
used  to  simplify  this  for  x > xQ 
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(16)  u(x,t)  = I tgt)1  - 2?  dk  (/7)k  H£  (i//T,  t) 

i-0  l-  k=0  K K 

+ 0 ((/"e)^n+^  where  x = x - x0 

By  similar  derivation  the  corresponding  formula  for  x < xQ 
can  be  obtained: 

(17)  u(x,t)  = Z (Et)1y-^(x]^  + 2S  dk(/T)k  Hk  (x//7,t) 

i=0  lm  k=0  K K 

+ 0 ( (vre)2n+1) 

In  this  form  it  is  apparent  how  the  inner  solution  given 
by  (16)  for  x > xQ  and  by  (17)  for  x < xQ  is  just  the  outer  solution 
plus  a correction  for  the  interior  layer.  An  analysis  of  the 
functions  Hn  and  H*  would  show  that  Hn(x,t)  becomes  neglible  faster 
than  any  power  of  (1/x)  as  x + -">  and  similarly  H*(x,t)  becomes 
neglible  as  x +°°.  The  correction  terms  in  (16)  and  (17)  are 
therefore  neglible  except  in  a neighborhood  of  the  subcharacteristic 

x " V ^ 

6.  Solution  for  Piecewise  Polynomial  Initial  Data.  In 
previous  sections  asymptotic  analysis  was  used  to  derive  approxi- 
mate  solutions  to  (1),  (2),  (3)  assuming  that  e was  small.  In 
this  section  we  show  how  an  explicit  solution  in  series  form  can 
be  obtained  for  the  same  problem,  regardless  of  the  magnitude  of 
e,  when  4>  (x)  is  a piecewise  polynomial  (with  no  continuity  required 
between  the  pieces).  Any  initial  value  function  can  be  approxi- 
mated, as  accurately  as  necessary  by  such  functions  and  the  maximum 
principle  for  parabolic  equations  guarantees  that  no  greater  error 
is  thereby  introduced  into  the  solution.  If  et  is  small  the  solu- 
tion can  be  considerably  simplified  by  dropping  insignificant  terms 
and  in  the  limit  we  obtain  the  same  asymptotic  representations  as 
before  - (7),  (16)  and  (17). 

The  solution  will  be  obtained  by  writing  the  function  <J>  (x) 
as  a combination  of  the  functions  hn  and  h*  and  then  expressing 
u(x,t)  as  the  same  combination  of  Hn  and  H|[.  We  shall  suppose 

there  are  points  x^  such  that  -°°  _< < x_2  < x_^  < xQ  < x^  < 

_<  « and  polynomials  p^(x)  with 

*00  = Pi 00  in  [xi,  xi+1). 

The  closed  bracket  is  a convenience  here  since  the  value  of  $ at 
any  one  point  is  immaterial.  The  index  i is  assumed  to  vary  over 
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some  index  set  I = J u K where  J = {1,2,...}  and  K = {0,-l,-2,...}. 
These  sets  may  be  finite  or  infinite  but  are  non-void  and  we  assume 
that  only  finitely  many  x^  are  contained  in  any  bounded  interval. 

For  k > 0,  let 

n(k)  = max  {degree  (pi)  : i = 0 , 1, ,k} 


n(-k)  = max  {degree  CPi 3 : 


The  representation  of  <(»  (x)  has  jumps  at  the  points  x^  so 
we  define 

di  ="iTT  " Pi-l(xi^  £or  xi  ^ ” 

Let  the  coefficients  of  the  polynomial  p0(x)  be  denoted  by 
a^;  that  is 


n(o) 

E 

i-0 


P0(x)  - aA  x 


(18) 


The  initial  values  can  now  be  expressed 
n(o)  . n(j) 

f(x)  = E a-x1  + E ( E d1}  h (x  - x.)) 
i=o  jej  n=0  J n J 


n(k) 

- E { 2 
keK  n=0 


d?  h* 
k n 


(x  - xk)) 


This  series  is  well  defined  since  for  any  given  x only  a fi- 
nite number  of  terms  are  non-zero.  (Recall  hn  (x  - x-)  = 0 if 
x < Xj  and  h£  (x  - x^)  = 0 if  x > x^.)  ^ 


The  functions  Hn(x  - Xj,Et)  and  H£(x  -xk,et)  satisfy 
equation  (1)  and  have  the  initial  values  hn(x  - x.;)  and  h*  (x  -xk) 
respectively.  Using  the  linearity  of  the  solutions  of  (lj  we 
may  therefore  write  a formal  expression  for  u(x,t) 

n(o)  n(j) 

(19)  u(x,t)  = E o.  v.  (x,et)  + E EdV  H (x-x- ,et) 

i-0  11  jej  n=0  J n J 


- E 

keK 


nOO  n 

dk  HX  (x  ■ xk>et3 


When  J and  K are  finite  this  representation  is  not  merely 
formal  but  actually  solves  (1),  (2),  (3).  When  J or  K is  infinite 
the  representation  is  still  the  real  solution  of  the  problem  under 
certain  general  growth  conditions  on  the  polynomials  p^(x).  For 
this  purpose  we  make  the  following  definition: 
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The  family  of  functions  {pj_ (x) : ie  I * J u K}  is  said  to 
be  outward  bounded  by  f (x)  with  respect  to  the  points  {x^:  i€I=JUK} 
if 

|Pj  00  I J5  f(x)  for  all  x > Xj 

|pk  Cx)|  < f(x)  for  all  X < xk+1 

Theorem:  The  formal  series  (19)  converges  for  0<t<l/4ea 

to  the  solution  of  (1),  (2)  (3)  when  $(x)  is  given  by  (18)  provided 
{Pi(x):  i€I  = JUK)  is  outward  bounded  by  2M  exp [ax2]  with  respect 
to  the  points  '{x^:  iei}  for  some  M,  a >.  0. 

Proof:  Omitted.  The  argument  is  fairly  straightforward 

using  the  integral  representation  of  u(x,t)  and  the  properties 
of  the  fundamental  solution  F(x,t). 

7.  Asymptotic  Analysis.  The  solution  (19)  of  problem 
(1),  (2),  (3)  in  general  will  contain  an  infinite  number  of 
terms.  However  most  of  these  are  neglibible  even  when  et  is  not 
small  provided  x - Xj  and  x - xk  are  large  enough. 

This  becomes  apparent  from  studying  the  asymptotic  behavior 
of  Hn  and  H$  which  can  be  summarized  as  follows: 

a. )  for  fixed  x > 0 H5(x,et)*0  faster  than  any  power 

of  et. 

b. )  for  fixed  x < 0 Hn(x,et)+Q  faster  than  any  power 

of  et . 

c. )  for  x > 0 and  x _<  0 (/7t ) H*(x,et)  is  0 (/etn) . 

d. )  for  x < 0 and  x _<  0 (/"et ) Hn(x,et)  is  0(/7tn). 

In  the  representation  (19)  of  u(x,t)  we  assume  without  loss 
of  generality  that  x0  _<x  < xj  (otherwise  the  indexes  can  be  shifted). 
It  follows  that  x ~ Xj  <0  for  jej  and  x - xk  >_  0 for  k£K  with 

equality  only  possible  for  kB0.  Using  a.)  - d7)  we  may  conclude 

that  terms  Hn(x  -Xj  , et)  and  Hi  (x  - xk,et)  are  neglible  as  et+0 
unless  x - x*  and  x - x.  are  o (/7t) . Therefore  we  define  the 
sets  J and  it.  by  * 

t t 

Je  - (j€J:  x - xj  - 0(/et)} 

Ke  - (k€K:  x - xk  - 0(/et) } 

These  sets  must  be  finite  since  only  finitely  many  x^  and  xk  are 
contained  in  any  bounded  interval.  For  asymptotic  approximations 
to  U(kft)  as  c ^ 0 we  can  restrict  the  index  sets  J and  K to  J 
and  K . Furthermore,  for  N-th  order  accuracy  the  terms  H ande 
H*  can  be  neglected  if  n > N,  because  of  c.)  and  d.).  The  repre- 
sentation (19)  can  therefore  be  written  as 
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(20)  u(x,t)  - Z a.  v.  (x,et)  + Z Z d1?  H (x-x^et) 

i-0  jej  n=0  J n J 

£ 


- r 1 K H£  Cx-xk,et)  + Rn 

k€K  n=0  K 11  K N 

£ 

where  the  remainder  =•  0(/Tt)N+1  as  et  0. 

The  sets  J and  K are  monotone  decreasing  with  respect  to 
e (in  the  set  theoretic  fense)  and  eventually  reduce  to  at  most 
one  element.  The  resulting  form  of  (20)  is  then  equivalent  to 
the  inner  solution  representations  (16),  for  x - x,  = 0{/"et),  or 
(17),  for  x - xQ  = 0(/Tt)  and  (7)  otherwise. 

8.  Mixed  Boundary  - Initial  Value  Problems.  One  of  the 
techniques  for  solving  mixed  boundary  - initial  value  problems 
is  to  convert  the  problem  into  an  appropriate  pure  initial  value 
problem  for  which  the  fundamental  solution  is  known.  The  solution 
of  the  new  problem  is  then  shown  to  satisfy  all  of  conditions  of 
the  original  problem.  This  approach  is  particularly  useful  in 
the  present  case  since  we  have  developed  all  of  the  necessary 
tools  for  solving  the  Cauchy  problem. 

Unfortunately  there  does  not  seem  to  be  any  standard  re- 
ference for  this  technique  and  we  do  not  have  space  here  for 
discussing  the  matter.  Therefore  we  shall  only  indicate  the 
proper  initial  value  problem  which  can  be  used  to  solve  certain 
mixed  boundary  - initial  value  problems.  Greater  detail  will  be 
included  in  [5]  and  is  being  submitted  for  publication  in  a 
separate  paper. 

The  following  list  is  valid  for  the  equation  u.  = eu  . 

w A A 

A.  Semi-infinite  domain,  zero  boundary  condition 


Mixed  BVP-IVP:  u(x,0)  = *(x)  x > 0 

u(o,t)  = 0 t > 0 

Corresponding  IVP : u(x,o)  = <j>  (x ) x > 0 

u(x,o)  = -<£(-x)  x<0 

B.  Semi-infinite  domain,  zero  initial  condition 

Mixed  BVP-IVP:  u(x,o)  = 0 x > 0 

u(o,t)  = hn(t)  t > 0 


Corresponding  IVP:  u(x,o)  = 2h^n(x//T) 
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c. 


Finite  domain,  zero  boundary  conditions 

Mixed  BVP-IVP:  u(x,o)  = hn(x) 

u(o,t)  = 0 

u(x0,t)  = 0 

Corresponding  IVP:  u(x,o)  = hn(x-2k  xQ)  - 

for  (2k-l)  xQ 


t > 0 
t > 0 

h*(x-2k  x0) 

< x < (2k+l)xQ 


D.  Finite  domain,  one  non-zero  boundary  condition 


Mixed  BVP-IVP:  u(x,o) 

u(o,t) 

u(x0,t) 

Corresponding  IVP:  u(x,o) 


o 0 < x < x( 

hn(t)  t > 0 

0 t > 0 

00  ,x+2ix0 

2 ,-n  ^ 

j = 0 / e 


- 2 


00 

Z h2n 
k=0 


x-2k  x. 


(- 


/I 


These  correspondences  combined  with  the  theory  developed  in 
previous  sections  for  the  Cauchy  problem  allow  us  to  solve  the 
heat  conduction  problem  in  a finite  rod  explicitly  when  the  data 
is  given  in  piecewise  polynomial  form. 

8.  Conclusions.  Basically  what  we  have  developed  in  this 
discussion  is  an  alternative  to  the  Fourier  method  for  solving  the 
diffusion  equation.  The  present  method,  based  on  asymptotic  analysis, 
is  accurate  for  small  et  and  is  particularly  advantageous  when 
any  sort  of  discontinuities  are  present  in  the  initial  and  boundary 
data.  In  general  very  few  terms  are  needed  to  give  accurate 
estimates  of  the  solution.  To  contrast  this  with  the  Fourier  approach 
one  should  note  how  many  terms  are  required  to  resolve  a discontinuous 
function  into  sines  and  cosines  with  reasonable  accuracy.  Although 
the  higher  frequency  modes  tend  to  cancel  out  and  become  insigni- 
ficant with  time  this  is  not  helpful  in  describing  the  early  develop- 
ment of  the  solution. 


In  our  opinion  the  present  method  is  a far  more  natural  approach 
to  diffusion  problems  when  steep  gradients  are  present.  It  also 
has  the  advantage  of  providing  formulae  rather  than  numbers  for  des- 
cribing the  physical  behavior. 
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SAMPLE  SIZES  FOR  MISSILE 
IN  FLIGHT  RELIABILITY  DETERMINATION 

Edward  F.  Southvorth 

Army  Missile  Test  and  Evaluation  Directorate  (AJRMTE) 
White  Sands  Missile  Range,  New  Mexico  88002 


ABSTRACT.  This  paper  presents  an  unusual  approach  to  determining 
sample  sizes  required  to  adequately  measure  reliability.  This  approach 
has  been  tailored  to  the  specific  problem  of  measuring  the  in  flight 
reliability  of  guided  missiles.  It  should  also  have  application  to  other 
types  of  problems.  The  paper  .includes  the  rationale  used  to  develop  a 
measure  of  the  probability  of  a successful  reliability  test.  It  then  de- 
rives equations  for  calculating  the  probability  of  a successful  relia- 
bility test.  Finally  the  paper  presents  plots  of  the  probability  of  a 
successful  reliability  test  as  a function  of  sample  size  for  a vide  range 
of  reliability  testing  goals. 

1.  INTRODUCTION.  As  the  name  suggests,  the  primary  function  of  the 
Army  Missile  Test  and  Evaluation  Directorate  (ARMTE)  is  test  and  evalua- 
tion of  Army  guided  missile  systems.  A small,  but  important,  part  of 
this  function  is  the  determination  of  the  reliability  of  these  systems. 

Of  particular  concern  is  the  determination  of  the  in-flight  reliability 
of  the  missiles  themselves.  There  are  three  reasons  for  this  concern. 

a.  First,  the  unit  cost  of  most  guided  missiles  is  very  high.  In 
addition,  missiles  are  produced  in  large  quantity.  Many  more  missiles 
are  produced  than  other  portions  of  most  missile  systems. for  the  same 
reason  that  many  more  rounds  of  ammunition  are  produced  than  the  guns 
that  fire  them.  Initiating  large  scale  production  of  a specific  missile 
configuration  requires  a major  decision.  Naturally,  the  decision  makers 
want  assurnace  that  missile  performance,  including  reliability,  will  be 
satisfactory  before  committing  large  sums  to  missile  production. 

b.  Second,  most  modern  missiles  are  configured  as  certified  rounds. 
The  missiles  are  sealed  at  the  factory.  They  are  not  designed  to  be 
maintained  in  the  field.  The  field  army  is  neither  trained,  equipped, 
nor  authorized  to  maintain  them.  Any  problems  discovered  after  they  are 
manufactured  can  only  be  corrected  by  very  expensive  retrofit  programs. 

c.  Third,  there  is  a lot  more  than  money  involved.  Mistaken  esti- 
mates of  missile  performance  contribute  to  mistakes  in  the  tactical  doc- 
trine developed  for  optimum  use  of  our  weapon  systems.  Large  discrepan- 
cies in  our  estimate  of  in  flight  reliability  could  lead  to  serious  mis- 
calculation of  our  defense  posture.  Such  miscalculations  can  be  the 
cause  of  tactical  blunders. 
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Since  measuring  missile  in  flight  reliability  is  so  important,  why 
not  just  measure  it?  The  answer,  of  course,  is  that  missile  in  flight 
reliability  can  only  be  measured  in  very  expensive  missile  firing  tests. 
These  tests  are  costly  because  of  the  high  cost  of  the  missiles  which 
must  be  expended,  because  of  the  high  cost  of  the  range  support  required, 
and  because  of  the  high  cost  of  the  targets  needed  for  many  of  these 
tests. 

It  is  obvious  that  an  adequate  measure  of  missile  in  flight  reliabi- 
lity must  be  obtained  with  a minimum  number  of  firings.  The  situation 
justifies  more  complex  test  planning  techniques  than  are  usually  employed 
in  designing  reliability  experiments. 

There  is  one  other  unusual  feature  about  this  problem.  ARMTE  is  re- 
sponsible for  measuring  missile  in  flight  reliability,  but  ARMTE  is  not 
responsible  for  making  any  decisions  on  the  adequacy  of  this  reliability. 
ARMTE  is  not  involved  in  determining  acceptance  or  rejection  criteria, 
buyers'  risks,  or  sellers'  risks,  lor  this  reason,  the  standard  opera- 
ting characteristics  curves  have  limited  value. 

This  paper  will  first  discuss  ARMTE 's  reliability  testing  goals.  It 
will  then  define  a measure  of  the  effectiveness  of  test  designs  in  achiev 
ing  these  reliability  testing  goals.  Finally,  it  will  present  families 
of  graphs  which  plot  the  probability  of  a successful  reliability  test  as 
a function  of  sample  size  for  a wide  variety  of  reliability  testing  goals 

2.  RELIABILITY  TESTING  GOALS.  Tne  Army  Missile  Test  and  Evaluation 
Directorate  has  three  types  of  missile  in  flight  reliability  testing 
goals.  Hie  first  goal  is  to  demonstrate  that  reliability  is  satisfactory 
If  we  can  accomplish  that,  all  parties  are  happy  and  the  Army  obtains  re- 
liable missiles.  To  do  this  we  must  produce  convincing  evidence  that 
reliability  equals  or  exceeds  whatever  level  is  accepted  as  satisfactory. 

The  second  goal  is  the  opposite  of  the  first.  If  we  are  unable  to 
demonstrate  that  reliability  is  satisfactory,  the  possibility  that  re- 
liability falls  short  of  the  satisfactory  level  must  be  investigated. 

The  third  goal  is  addressed  if  we  cannot  accomplish  either  of  the 
first  two  goals.  If  we  cannot  demonstrate  that  reliability  is  satisfac- 
tory, and  we  also  fail  to  demonstrate  that  reliability  is  not  satisfac- 
tory, then  our  goal  is  to  demonstrate  that  reliability  is  near  the  satis- 
factory level.  Our  ability  to  accomplish  this  goal  is  dependent  upon  the 
effectiveness  of  the  test  design  used  to  accomplish  the  first  two  goals. 
If  the  test  design  provides  a high  probability  of  detecting  small  diffe- 
rences between  actual  reliability  and  the  satisfactory  level,  it  is  also 
capable  of  demonstrating  that  reliability  is  very  near  the  satisfactory 
level.  In  such  a test,  failure  to  demonstrate  that  reliability  is  either 
satisfactory  or  unsatisfactory  provides  high  confidence  that  it  is  near 
the  satisfactory  level.  On  the  other  hand,  test  designs  which  can  detect 
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only  large  variations  from  the  satisfactory  reliability  level  can  only 
define  the  value  of  reliability  within  broad  bands  of  uncertainty. 

3.  A MEASURE  OF  TEST  DESIGN  EFFECTIVENESS.  Nov  that  ARMTE's  re- 
liability testing  goals  have  been  described,  a measure  of  the  effective- 
ness of  test  designs  in  achieving  these  goals  will  be  defined.  A test 
which  achieves  ARMTE's  reliability  testing  goals  must  satisfy  two  criteria. 
First,  it  must  produce  a useful  conclusion  about  reliability.  The  con- 
clusion which  the  test  produces  can  satisfy  any  one  of  the  three  relia- 
bility testing  goals  described  in  paragraph  2,  above,  to  be  useful.  Se- 
cond, the  conclusion  the  test  produces  must  be  a correct  conclusion. 

Based  upon  these  criteria,  the  probability  of  a successful  ARMTE  relia- 
bility test  is  defined  as  the  probability  that  the  test  produces  a con- 
clusion which  satisfies  one  of  ARMTE's  reliability  testing  goals,  multi- 
plied by  the  conditional  probability  that  the  conclusion  is  correct. 

k.  DETERMINING  THAT  THE  CONCLUSION  IS  CORRECT.  A major  part  in  pre- 
dicting the  probability  of  a successful  ARMTE  reliability  test  is  deter- 
mining the  probability  that  a conclusion  about  reliability  is  correct. 

A good  measure  of  the  correctness  of  a conclusion  about  reliability  is 
the  statistical  confidence  which  can  be  placed  in  that  conclusion.  Given 
the  results  of  a reliability  test  in  which  there  were  s successes  in  n 
trials,  it  is  easy  to  calculate  the  statistical  confidence  that  actual 
reliability  is  greater  than  or  equal  to  the  satisfactory  level.  It  is 
also  easy  to  calculate  the  statistical  confidence  that  actual  reliability 
is  less  than  the  satisfactory  level.  - Statistical  confidence  is  calculated 
using  the  binomial  distribution.  The  equations  for  calculating  statisti- 
cal confidence  using  the  binomial  distribution  are  well  known.  However, 
ready  reference  to  these  equations  is  needed  for  the  further  development 
of  this  paper.  Therefore,  these  equations,  and  their  rationale  will  be 
briefly  reviewed. 


Figure  1 shews  the  equations  used  in  calculating  confidence  and  pro- 
vides some  examples  which  explain  the  significance  of  the  confidence 
values  calculated. 


Equation  1 is  merely  a statement  of  the  most  basic  equation  of  the 
binomial  distribution.  It  states  that  the  probability  of  exactly  "i" 
successes  in  "n"  trials,  if  the  probability  of  success  in  each  trail  is 
"p",  equals  n factorial,  times  p to  the  power,  times  (l  - p)  to  the 
n - i“*  power,  divided  by  i factorial,  and  also  divided  by  (n  - i)  fac- 
torial. 0 factorial  is  defined  as  equal  to  1. 


The  second  column  under  examples  shows  the  probability  of  achieving 
various  numbers  of  successes  in  25  trials  if  the  probability  of  success 
in  each  trial  equals  .85.  For  example  the  probability  of  achieving 
exactly  21  successes  under  these  conditions  equals  .2110. 
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FIGURE  1 


EQUATIONS : 

1.  P(i,  n,  p)  = — — where  01  = 1 


, i...  .n  - i 

n!  P (1  - p) 

I!  (n  - i’)! 


_D 


3.  C(r>Pc)  - 1 - f~  P(i,  n,  Pc)  - 1 - --rr 


i 


nl  PeJ  (l  - Pc) 
(n  - i)  I 


n - i 


n 


4‘  C<*<pc>  - r^TT-l  *>'Pc  > * '~YI 


n!  PcVl  ~ Pc)n  " j 


(n  - i) : 


EXAMPLES: 


CAV^rLiPO  ; 

*H 

! 

C 

✓*N 

/*N 

o 

a 

1 

/"■v 

Pi 

>N 

i 

c 

m 

o 

1 

l 

■U 

H 

GO 

Pm 

cc 

si 

m l 

*rf 

* 

* 

• 

U3 

a 

co  a 

!-  • 

'/-v 

*n 

Cu 

•H 

A! 

V 

*H  ■ 

f t 

CO 

i.  P~ 

! *H 

,X2 

*>-  « 

Pi 

Pi  - 

tu 

in 

a.  i 

i 

CC  t/’i 

w 

}*H 

C 

4) 

i™''* 

: rJ 

to 

4' 

' ti 

/■ 

jw 

u 

Pm 

p 

o 

u 

n to 

CO  rJ 

4-! 

M *H  < 
i 

>>  - • I 

0 # 

i*H 

o • 

O' 

p<  i 

fe 

*N 

nj 

£ 

J 

cd 

.c 

H p 

1 

✓.II 

V-H 

<U 

(\i 

n 

*.  'i 

Cj 

u c \ 

p- 

P 

I 

<u 

*H  f 

P> 

#* 

V 

iH 

u 

1! 

a p 

T-H  v" 

rs 

jj 

S3 

w 

•H  f 

4-t  H 

0) 

✓"V 

ai 

CJ 

th3 

Vm 

/J 

* 

i'.i 

’H 

**0. 

CJ 

rTJ 

• Pi 

?5  in 

c;  [ 

iH 

'■w' 

♦H 

Pm 

*H 

V 

4J 

CN 

t 

ro 

Pm 

ni 

P 

ii 

O *H 

E 

1 

*co 

a 

p 

r-J 

iH 

,1  p 

P v 

D 

P A « 

I/Vh 

0 

V .t' 

0 

0 

FT  *r< 

P<  Pm 

cj 

o 

U 

CJ 

CJ 

11. 

2.6 

.0172 

.0172 

CO 

ev 

cc 

Ox 

• 

. 0000 . 

1 . 00 

?A 

.0759 

.0931- 

.9069 

.0172 

.96 

23 

.1607 

.2537 

.7463 

.0951 

.92 

22 

.2174 

.4711 

.5289 

.2537 

.<88 

21 

.2110 

.6821 

.3179 

.4711 

.84 

20 

.1564 

.8385 

.1615 

. 6821 

.80 

19 

.0920 

.9305 

.0595 

.8385 

.76 

18 

.0441 

.9745 

.0255 

.9305 

.72 

17 

.0175 

.9520 

.0080 

.9745 

.68 

16 

.0053 

.9979 

.0021 

.9920 

. 64 

15 

.0016 

.9995 

.0005 

.9979 

.60 

)A 

.0004 

.9999 

.0001 

.9995 

.56 

13 

.0001 

1.0000 

.0000 

.9999 

.52 

12 

.0000 

1.0000 

.0000 

1.0000 

.48 

562 


Eq.uati.on  2 is  basic  also.  It  is  a statement  of  the  cumulative  pro- 
bability from  the  binomial  distribution.  It  shows  that  the  probability 
of  s or  more  successes  in  a trials,  if  the  probability  of  successes  in 
each  trial  equals  o,  is  equal  to  the  sum  of  the  probability  of  s successes 
in  a trials,  plus  the  probability  of  s + 1 successes  in  n trials,  and  so 
on  until  the  probaoility  of  n successes  in  n trials  has  oeen  included. 

The  third  column  under  examples  shows  the  cumulative  probability  of 
s or  more  successes  in  2Lj  trials,  if  the  probability  of  success  in  each 
trial  equals  .6b,  for  various  values  of  s.  Each  cumulative  probability 
in  column  3 equals  the  sum  of  the  individual  probabilities  in  column  2 
associated  with  that  number  or  more  successes.  lor  example,  the  pro- 
bability of  21  or  more  successes  under  these  conditions  is  .6821.  This 
is  the  sum  of  the  probability  of  exactly  21  successes,  plus  the  proba- 
bility of  exactly  22  successes,  and  so  on  until  the  probability  of  2? 
successes  in  25  trials  has  been  added. 


Equation  3 requires  a little  explanation.  It  is  used  to  calculate 
confidence  that  reliability  equals  or  exceeds  some  criterion.  In  this 
case,  the  criterion  is  the  satisfactory  level  for  in  flight  reliability. 
The  far  left  terra  in  the  equation  "C  (ri(pc)  ' means  "confidence  that 
reliability  equals  er  exceeds  criterion".  The  symbol  "pc"  designates 
the  probability  cf  success  in  each  trial  which  satisfies  the  criterion. 

It  is  interesting  to  note  that  the  center  and  right  terms  of  equation  3 
are  the  complements  of  the  corresponding  terms  of  equation  2.  Equation 

rau- 
Equa- 


2 defines  the  probability  that  s or  more  successes  would  occur  in 
dom  trials  if  the  probability  cf  success  in  each  trial  equals  "p", 
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This  confidence, 


as  shown,  equals  unity  minus  the  probability  that  the  result  would  occur 
from  random  sampling.  Equation  3 is  used  to  calculate  conf idence  'that  a 
given  test  result  demonstrates  that  reliability  equals  or  exceeds  a satis- 
factory level. 


Column  4,  under  examples,  shows  the  confidence  that  reliability  equals 
or  exceeds  .85  if  we  experience  s successes  in  25  trials.  As  you  would  ex- 
pect, each  value  in  column  k equals  unity  minus  its  corresponding  value  in 
column  3* 

Equation  4 is  used  to  calculate  confidence  that  reliability  is  less 
than  the  criterion.  It  is  very  similar  to  equation  2.  The  only  difference 
is  that  "i"  is  summed  from  s + 1 to  n in  equation  h whereas  "i"  is  summed 
from  "s"  to  n in  equation  2.  Equation  2 is  used  to  determine  the  proba- 
bility of  s or  more  successes  in  n trials  if  the  probability  of  success  in 
each  trial  equals  p.  The  principal  of  equation  4 is  that  this  same  pro- 
bability is  the  confidence  that  reliability  is  less  than  p if  we  exper- 
ience less  tnan  s successes  in  n trials.  This  can  most  easily  be  illustra- 
ted by  example.  Column  3 under  examples  shows  that  the  probability  of  21 
or  more  successes  in  25  trials  if  p equals  .85  is  .6821.  Therefore  if  we 
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experience  fewer  than  21  successes  in  25  trials  we  can  have  a confidence 
of  .6821  that  p is  less  than  .85.  Column  5 under  examples  shows  this  con- 
fidence value  for  the  case  in  which  we  experience  only  20  successes  in  25 
trials.  In  fact  every  entry  in  column  5 is  numerically  equal  to  the  next 
higher  entry  in  column  3* 

There  is  one  more  point  that  I would  like  to  discuss  before  leaving 
Figure  1.  The  far  right  column  under  examples  shows  the  point  estimate 
for  reliability  for  each  number  of  successes  in  25  trials.  As  you  can 
see,  the  point  estimate  is  merely  the  numerical  equivalent  of  the  number 
of  successes  divided  by  the  number  of  trials.  It  is  important  to  note 
that  whenever  the  point  estimate  exceeds  the  criterion  for  reliability 
then  confidence  that  reliability  exceeds  criterion  is  greater  than  con- 
fidence that  reliability  is  below  the  criterion.  In  the  examples,  the 
criterion  is  .85.  When  22  successes  are  obtained  in  25  trials  the  point 
estimate  is  .88.  In  this  case  confidence  that  reliability  exceeds  .85 
is  shown  as  .5289.  Confidence  that  reliability  is  below  .85  equals  only 
,2537*  On  the  other  hand,  if  there  are  only  21  successes  in  25  trials 
the  point  estimate  is  .84  which  is  below  the  criterion  of  .85.  Note  that 
when  the  point  estimate  is  below  the  criterion  there  is  greater  confi- 
dence that  reliability  is  below  the  criterion  than  the  confidence  that 
it  is  above.  This  example  illustrates  a rule  that  applies  to  all  cases. 

5.  PROBABILITY  OF  A SUCCESSFUL  TEST:  The  first  four  equations  on 

Figure  2 are  merely  repetitions  of  the  four  equations  just  discussed  on 
Figure  1.  They  are  repeated  to  lend  continuity  to  this  discussion.  They 
provide  an  important  link  from  the  familiar  to  the  unfamiliar  represented 
by  equation  5*  Equation  5 i&  the  first  unusual  feature  to  the  approach 
presented  in  this  paper. 

The  left  side  of  equation  5>  "Pst  (n,  r,  pc)",  means  "probability  of 
a successful  test"  if  the  sample  size  equals  "n",  the  actual  reliability 
equals  "r"  and  the  probability  of  success  in  each  trial  which  satisfies 
the  criterion  equals  "pc".  The  right  side  of  the  equation  is  a double 
summation.  It's  similar  to  a double  integral  except  that  it  applies  to 
sample  sizes  which  can  only  be  whole  numbers  rather  than  continuous 
variables.  Equation  5 is  the  method  used  to  predict  the  probability  of 
a reliability  test  which  successfully  accomplishes  our  first  goal  of 
demonstrating  that  actual  reliability  equals  or  exceeds  the  satisfactory 
level.  In  other  words,  it  predicts  the  probability  that  we  can  success- 
fully demonstrate  that  "r"  equals  or  exceeds  "Pc". 

The  right  side  of  equation  5 is  divided  into  two  major  parts.  The 
part  which  immediately  follows  the  first  summation  sign  calculates  the 
probability  of  achieving  exactly  s successes  in  n trials  if  the  proba- 
bility of  success  in  each  trial  (i.  e.  reliability)  equals  "r".  You  can 
see  that  this  part  of  the  equation  is  in  the  same  form  as  equation  1 which 
is  used  for  the  same  type  of  calculation.  The  last  part  of  equation  5 
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calculates  confidence  that  reliability  equals  or  exceeds  the  criterion, 
"pc",  if  we  experience  exactly  "s"  successes  in  "n"  trials.  You  can  see 
that  this  part  of  the  equation  is  in  the  same  form  as  equation  3 which 
is  used  for  the  same  type  of  calculation.  The  two  parts,  together,  equal 
the  probability  of  achieving  "s"  successes  in  "n"  trials,  multiplied  by 
the  confidence  provided  by  "s"  successes  in  "n"  trials  that  reliability 
equals  or  exceeds  the  criterion,  "pc". 

The  remaining  portion  of  equation  5 is  the  first  summation  sign. 

This  summation  sign  shows  that  the  calculation  described  above  is  repeated 
for  different  values  of  "s"  and  that  the  results  are  added  to  obtain  the 
probability  of  a successful  test.  All  values  of  "s"  are  used  which  equal 
or  exceed  the  sample  size  multipled  by  the  criterion,  "p  Earlier  I 
pointed  out  that  all  values  of  "s"  which  equal  or  exceed  this  quantity 
contribute  to  our  confidence  that  reliability  is  satisfactory. 

FIGURE  2 


EQUATIONS 
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Figure  3 shows  one  of  the  consequences  of  this  relationship.  This 
Figure  lists  the  values  of  "s"  which  contribute  to  our  confidence  that 
reliability  is  satisfactory  for  various  numbers  of  samples  and  values  of 
criteria.  It  is  evident  that  there  are  more  values  of  "s"  which  contri- 
bute as  sample  size  is  increased  and  as  the  criterion  "pc ",  is  reduced. 

Each  condition  under  which  an  additional  value  of  "s"  is  introduced  into 
the  calculation  is  marked  by  a pip  on  Figure  3*  Each  time  an  additional 
value  of  "s"  is  introduced,  there  is  a discontinuity  in  the  graphs  which 
show  the  probability  of  a successful  test.  The  result  is  that  tnese  graphs 
have  a sawtooth  characteristic  to  them.  The  next  figure  will  demonstrate 
this  characteristic. 

Figure  4 shows  a family  of  graphs  which  plot  the  probability  of  a 
successful  test  as  a function  of  sample  size.  Each  graph  is  applicable 
to  the  goal  of  demonstrating  that  reliability  is  equal  to  or  greater  than 
•93.  The  bottom  graph  is  a plot  of  the  probability  of  achieving  this 
goal  if  the  actual  probability  of  success  in  each  trial  (P  ) equals  .95* 

In  other  words,  actual  reliability  equals  the  criterion.  The  other  graphs 
apply  to  cases  in  which  actual  reliability  exceeds  the  criterion.  Graphs 
are  provided  for  reliability  values  of  .96,  .97,  .pBand  .99* 

'There  are  several  features  of  these  graphs  which  require  explanation. 
First,  they  a re  plotted  as  though  they  are  continuous ‘functions.  This 
was  done  to  connect  together  successive  points  associated  with  the  same 
actual  probability  of  success  in  each  trial.  Obviously,  the  only  points 
on  the  graphs  that  have  any  meaning  are  those  associated  with  whole  num- 
bers corresponding  to  discrete  sample  sizes.  Second,  the  sawtooth  shape 
of  these  graphs  has  already  been  explained,  but  further  explanation  would 
probably  be  helpful  new  that  you  can  see  this  characteristic.  The  pheno- 
mena, loosely  referred  to  as  discontinuities,  occur  at  sample  sizes  which 
are  multiples  of  20  if  the  goal  is  to  demonstrate  that  reliability  is 
equal  to  or  greater  than  .95.  The  reason  is  that  the  probability  of 
achieving  19  or  more  successes  in  20  trials  is  far  greater  than  the  pro- 
bability of  achieving  ly  successes  in  19  trials.  Similarly,  the  proba- 
bility of  achieving  3 8 or  more  successes  in  ko  trials  is  greater  than  the 
probability  of  achieving  3S  or  more  successes  in  39  trials.  At  every 
multiple  of  20  samples  the  experiment  can  accomodate  one  additional  fail- 
ure without  driving  the  point  estimate  below  .95-  Hence  the  sawtoothed  shape 
of  these  graphs.  There  is  a third  feature  of  these  graphs  that  requires 
explanation.  The  graphs  show  that  there  are  conditions  in  which  the  pro- 
bability of  a successful  test  decreases  slightly  as  sample  size  increases 
slightly.  This  phenomenon  is  very  disturbing  to  most  people.  It  defies 
intuition.  However,  it  actually  happens.  If  the  actual  reliability  is 
greater  than  the  satisfactory  level,  the  long  term  trend  is  always  to  in- 
crease the  probability  of  a successful  test  with  large  increases  in  sam- 
ple size,  However,  the  sawtoothed  shape  of  these  graphs  causes  small 
intervals  where  the  graphs  have  negative  slopes  when  the  actual  relia- 
bility is  only  slightly  greater  than  the  satisfactory  level. 
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FIGURE  3 
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PROBABILITY  OF  A SUCCESSFUL  TEST 


GOAL:  DEMONSTRATE  THAT  RELIABILITY  > 


CESSFUL 


Figures  5 thru  11  are  similar  to  Figure  4 except  that  they  apply  to 
different  levels  of  satisfactory  reliability. 

They  cover  goals  of  demonstrating  that  reliability  is  equal  to  or 
better  than  .90,  .85,  .80,  .75>  *70,  .65,  and  .60. 

Figure  12  adds  a sixth  and  final  equation  to  the  group.  Equation  6 
is  the  counterpart  to  equation  5*  Equation  5 is  used  to  determine  the 
sample  size  required  to  achieve  our  first  goal  of  demonstrating  that  re- 
liability is  satisfactory.  Equation  6 is  used  to  determine  the  sample  size 
required  to  achieve  the  second  goal  of  demonstrating  that  reliability  is 
unsatisfactory.  The  right  side  of  equation  6 is  divided  into  two  major 
parts,  as  equation  5 was.  The  part  which  immediately  follows  the  first 
summation  sign  calculates  the  probability  of  achieving  exactly  s successes 
in  n trials  if  the  probability  of  success  in  each  trial  (i.  e.  reliability) 
equals  r.  You  can  see  that  this  part  of  the  equation  is  in  the  same  form 
as  the  corresponding  part  of  equation  5 and  equation  1,  which  are  used  for 
the  same  type  of  calculation.  The  last  part  of  equation  6 calculates  con- 
fidence that  reliability  is  below  the  criterion,  "pc",  if  we  experience 
exactly  s successes  in  n trials.  You  can  see  that  this  part  of  the  equa- 
tion is  in  the  same  form  as  equation  4,  which  is  used  for  the  same  type 
of  calculation.  The  two  parts,  together,  equal  the  probability  of  achiev- 
ing s successes  in  n trials,  multiplied  by  the  confidence  this  result  pro- 
vides that  reliability  is  below  the  criterion,  pc. 

The  first  summation  sign  in  equation  6 shows  that  the  calculation  just 
described  is  summed  from  s = 0 to  the  highest  value  of  s which  satisfies 
the  relationship  that  s is  equal  to  or  less  than  the  number  of  trials 
multiplied  by  the  satisfactory  level  of  reliability.  In  other  words,  all 
values  of  s are  summed  for  which  the  point  estimate  of  reliability  is  be- 
low the  criterion.  Thus,  all  values  of  s are  summed  which  contribute  con- 
fidence that  reliability  is  unsatisfactory. 

Figure  13  is  similar  to  some  of  the  previous  figures  except  that  this 
one  shows  the  probability  of  demonstrating  that  reliability  is  less  than 
the  criterion  of  .95*  As  you  can  see,  the  graphs  on  this  Figure  also  have 
sawtoothed  shapes.  They  also  peak  out  at  sample  sizes  which  are  multiples 
of  20  and  then  drop  when  the  sample  size  is  increased  by  one.  On  these 
graphs,  also,  the  prevailing  trend  is  for  the  probability  of  a successful 
reliability  test  to  increase  as  sample  size  increases.  Again,  though, 
there  are  some  exceptions. 

Figures  14  through  20  are  similar  to  this  Figure  except  that  they 
apply  to  different  levels  of  satisfactory  reliability. 

They  cover  satisfactory  reliability  levels  of  .90,  .85,  .80,  .75,  *70, 
.65,  and  .60. 

5.  CONCLUSIONS. 
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FIGURE  12 


EQUATIONS 
1.  P (i,  n, 


3.  C (r>pc) 


A.  C ( r A.  pc) 
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a.  It  is  practical  to  plot  the  probabilities  of  achieving  a variety 
of  reliability  testing  goals  as  a direct  function  of  sample  size. 

b.  Such  plots  are  of  value  to  the  test  designer  because  they  provide 
him  a direct  view  of  the  trade  off  between  his  goals  and  their  cost  in 
additional  samples. 
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DOVAP  BEST  ESTIMATE  OF  TRAJECTORY 


Robert  H.  Turner  and  William  S.  Agee 
US  Army  White  Sands  Missile  Range,  NM  88002 


ABSTRACT.  The  DOVAP  Best  Estimate  of  Trajectory  (BET)  is  a new 
develop  men  tTn"  trajectory  data  reduction  at  WSMR,  The  DOVAP  BET  provides 
the  capability  to  produce  high  quality  trajectory  solutions  one  to  three 
days  after  a flight  test.  It  is  batch  processing  type  of  trajectory  solu- 
tion - it  estimates  all  trajectory  points  simultaneously.  This  type  of 
solution  inherently  results  in  a very  large  but  sparse  set  of  equations. 

In  the  DOVAP  case  this  set  of  equations  is  block  tridiagonal  and  is 
solved  by  an  DU  matrix  decomposition.  The  solution  for  the  trajectory 
from  the  LU  decomposition  must  be  iterated  since  the  original  DOVAP 
equations  ore  nonlinear.  Numerical  procedures,  program  structure,  and 
data  editing  procedures  used  in  the  DOVAP  BET  program  are  described. 

1.  INTRODUCTION.  The  DOVAP  Best  Estimate  of  Trajectory  (BET) 

program  provides  "tlie ability  to  produce  high  quality  trajectory  solutions 

one  to  three  days  after  a mission.  More  specifically,  a DOVAP  BET  can  be 
provided  in  2-3  work  days  with  normal  availability  of  digitized  DOVAP 
tapes.  Upon  special  request  and  arrangements  by  the  Range  User,  a DOVAP 
BET  can  be  provided  in  one  work  day.  Real-time  digitizing  and  central 
recording  of  DOVAP  data  would  make  1-2  day  DOVAP  trajectory  solutions  a 
standard  item. 

The  DOVAP  BET  is  a batch  processor  - it  estimates  the  entire 
trajectory  at  once.  The  editing  of  observations,  which  takes  care  of 
both  wild  data  and  bias,  is  entirely  automatic.  The  DOVAP  batch  pro- 
cessor,since  it  solves  a set  of  nonlinear  equations,  requires  a start- 
ing guess  for  the  trajectory.  The  program  will  accept  trajectory  guesses 
from  several  sources.  These  guesses  may  often  be  several  miles  in  error 
without  affecting  the  convergence  of  the  batch  processor.  The  guessed 
trajectory  must  be  reasonably  smooth,  i.e.,  it  must  be  free  from  spikes. 
Sources  of  guessed  trajectories  presently  used  in  the  DOVAP  BET  program 
are  radar,  a nominal  trajectory  or  flight  path,  or  a sequence  of  con- 
stant points.  Since  the  input  to  the  DOVAP  BET  program  is  a merged  data 
tape,  the  most  common  source  of  a guess  trajectory  is  radar  which  is 
inputted  from  the  same  merged  tape.  The  use  of  radar  guesses  does  not 
slow  the  production  of  a DOVAP  BET  if  the  central  record  facility  for 
radar  is  used.  The  radar  data  need  not  be  calibrated  for  use  as  a tra- 
jectory guess.  The  DOVAP  BET  program  automatically  despikes  the  radar 
observations  before  use  in  constructing  a trajectory  guess. 

The  DOVAP  measurements  are  refraction  corrected  by  the  method  des- 
cribed in  Appendix  A.  Since  the  DOVAP  tracking  ranges  are  relatively 
short  and  the  DOVAP  BET  program  automatically  rejects  DOVAP  observations 
when  the  tracking  elevation  is  below  a preset  minimum,  an  approximation 
to  flat  Earth  ray  tracing  equations  has  been  found  to  provide  a sufficiently 
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accurate  elevation  refraction  correction  for  use  in  the  method  of 
Appendix  A. 

2.  DOVAP  MEASUREMENT  MODEL.  The  DOVAP  measurement  system  is  a two 
way  C V.'  doppler  system  which  measures  a loop  range  change  or  equivalently 
an  average  loop  range  rate  of  a target  between  consecutive  sampling  times 
t-[  and  tj  + j.  'Die  normal  sampling  interval  is  50  msec.  A DOVAP  instrumen- 
tation path  consists  of  a transmitter  station  and  usually  ten  or  more  re- 
ceiver stations.  In  addition  the  target  being  tracked  carries  a frequency 
doubling  transponder.  The  DOVAP  receiver  station  receives  both  the  ref- 
erence frequency  from  the  transmitter  and  the  frequency  doubled  signal 
from  the  target  transponder.  A frequency  comparison  of  the  two  signals 
and  the  addition  of  a 5KHZ  bias  signal  is  performed  at  the  receiver.  The 
resulting  signal  is  converted  to  digital  form  by  counting  the  doppler  cycles 
over  a 50  msec  time  interval.  The  resulting  biased  doppler  measurement 
is 


(1) 


M(t.+1)  = 14.72194  s(t.,t.+1) 


+ 50278  + N.  . 

l+l 


where  s(tz>t^+j)  is  the  refraction  distorted  change  in  loop  range  over  the 
50  msec  interval.  The  loop  range  is  defined  as  the  sum  of  the  range  from 
transmitter  to  target  and  the  range  from  target  to  receiver.  is  a 

measurement  error  which  is  assumed  to  be  random  with  zero  mean.  Thus, 
except  for  refraction,  for  which  a correction  is  applied,  DOVAP  measure- 
ments are  considered  to  be  unbiased.  Subtracting  the  bias  50278  and 
dividing  by  14.72194  yields  the  modified  DOVAP  measurement  M'(tj+j) 


(2) 


"*<W  ■ 


s(t.  t.  .) 

l,  l+l 


N.  . 

l + l 


The  correction  to  this  measurement  for  refraction  is  discussed  in  Appendix 
A.  After  correcting  for  refraction  we  have  the  measurement  model  for  the 
ath  receiver. 


(3) 


z (t.,t.  .)  = g (x.,x.  .)  + v (i+1) 
a l*  l+l  a l*  i+l'  a 


where  ga(xj,xi+i)  is  the  loop  range  change  from  tj  to  t^+j  and  the  arguments 
Xi  and  x^+i  indicate  the  dependence  on  the  position  coordinates  at  these  times. 
Xi  is  the  position  vector  of  the  target  in  the  launcher  coordinate  system. 

The  loop  range  change  is  modelled  as  follows.  Let  (xp,yp,Zp)  be  the  co- 
ordinates of  the  electrical  center  of  the  ground  transmitter  in  the  launcher 
coordinate  system.  Also,  let  (xR,yR,zR)  be  the  coordinates  at  the  electrical 
center  of  the  ground  receiver  antenna.  The  z coordinates  of  the  surveyed  trans- 
mitter and  receiver  sites  are  modified  to  obtain  Zp  and  zR.  The  amount  of 
modification  depends  on  the  type  of  antenna  used.  Let  (x,y,z)  be  the  launcher 
coordinates  of  the  target  transmitting  and  receiving  antennas.  Let  Rp  be  the 
range  from  ground  transmitter  antenna  to  target  receiving  antenna  and  Rr  be 
the  range  from  target  transmitting  antenna  to  ground  receiver  antenna.  Rp 
and  Rr  are  given  by 
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(4) 


RT(t±)  = L(x(ti)-xT)2+(y(ti)-yT)2+(z(ti)-zT)2]  2 

(5)  RR(t.)  = [(x(ti)-xR)2+(y(ti)-yR)2+(z(t.)-zR)2]15 
Then  the  loop  range  change  g(x^,x^+^)  is 

(6)  g(xi>xi+i)  = + _ 

The  partial  derivatives  required  for  processing  DOVAP  observations 
using  the  above  measurement  model  are 
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3.  THE  DOVAP  BATCH  PROCESSOR.  Given  a set  of  N time  points  ti,i=l,N 
along  a trajectory  and  a set  of  N-l  loop  range  change  observations  Za(i), 
i=2,N  for  M DOVAP  receivers  a=l,M,  we  want  to  estimate  the  position  vector 
Xi  for  each  of  the  trajectory  time  points,  t^.  Recall  the  DOVAP  observa- 
tion model  given  by  (2-3) 


(9) 


Z (t.  ,,t.)  = g (x.  ,,x.)  + v (i) 
l-l  i &ct  l-l  l a 


The  measurement  error  v (i)  is  assumed  to  have  zero  mean  and  variance  R (i) . 

a o*  ' 


(10) 


The  DOVAP  batch  processor  minimizes  the  weighted  sum  of  squares 

N M 

Q(x1,x2,  ...,xn)  = I I (za(ti_i>ti)-g0,(xi_i>xi)) 

r-2  a=l  aK  J 


Taking  derivatives  of  Q with  respect  to  x^,X2,  . . . ,x^  and  equating  to  zero 
results  in  the  equations 


(11) 


(12) 


where 

04) 

(15) 


^ R (2)  GaltxP  (Za(tl5  V_gcJXl,X2'>:)_0 
a=l  J 
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^=1  Ra(i+1)  al  j a j i+1  a j j+i  Ra0)  r 
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In  order  to  solve  the  above  system  of  3N  nonlinear  equations,  it  is  neces- 
sary to  linearize  these  equations  about  a set  of  points  xi° > x2° > • • • > XN° • 
The  set  of  points  may  be  obtained  from  a number  of  sources  and  need  not 
be  very  close  to  the  solution.  The  linearization  results  in  a set  of 
linear  equations  to  be  solved  for  (Sx^x^-Xj0 . The  nonlinear  equations  are 
then  relinearized  about  the  new  trial  solution  xjl=x^0+6x^.  This  process 
is  repeated  until  the  solution  converges,  i.e. , |6xjJ-*0.  The  following 
development  of  the  solution  of  the  linearized  equations  indicates  only 
the  first  iteration  in  the  process  of  solving  the  set  of  nonlinear 
equations.  Each  succeeding  iteration  in  the  process  is  identical  to 
the  first  one  with  Xj°  replaced  by  x^o+6x^.  Linearization  of  (11-13) 
results  in 


(16) 

Al6xl  + A126x2  = 

(17) 

A.\  .<5x,  .+A.6X.+A.  . .<$x.  . » y(j),  j = 2,  N-l 

(18) 

an-i,n  6xn-i  + an6xn  = y(N') 

where 

(19) 

^1  " £ . R (2)  Gal  (xl  Gal(xl  ^ 
a=l  a 

(20) 

Aj  • l l fun  GIl(xj°)(:al(xj#)*  R ... 

J a=l  a(j  + l)  J J a Cj ) 

(21) 

AN  " i . R (N)  Ga2(xN  ^ Ga2 (xN  ^ 
a=l  av 

(22) 

M , T 

AK,K+1  %sl  F(K+1)  Gol  (XK  •*  Ga2  (xK+l  * 

(23) 

1 , hrr  Gh  (xi0)  V2) 

a=l  J 

(24) 

y0)  = L i.o«u  fb0)  r.0*1}  * 1.0) 

(25) 

yw-lt  rw  GL  (xn0) 

a=l  a 
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r (j)  is  the  residual 


(26)  r (j)  = Z (t.  rtj)-g  (x..!0,^0) 

The  system  of  linear  equations  (16-18)  can  be  written  in  the  block  tri- 
diagonal matrix  fora 


(27) 


0 0 
0 0 


o o <T 

6xj 

~y(D 

A23  0 0 

6x2 

y(2) 

A3  A34  0 0 

• 

6x3 

• 

y(3) 

♦ 

aJ  a a 

V2,N-1  N-l  N-1,N 

4 

* 

<XN-1 

• 

• 

y(N-l) 

0 an-i,n  an_ 

i 

1 

<-o 

1 

_y(N) 

This  set  of  equations  to  be  solved  for  the  6x's  is  of  very  large  dimension 
since  N,  the  number  of  trajectory  points  to  be  estimated,  is  typically  in 
the  range  50-300.  In  addition  these  equations  must  be  solved  many  times 
in  the  iteration  process  to  solve  the  nonlinear  set  of  equations  in  (11-13). 
Fortunately,  there  is  a very  simple  form  solving  the  block  tridiagonal 
system  equations  of  (27). 

Let  A be  the  above  block  tridiagonal  matrix.  Since  A is  a least  squares 
matrix,  it  is  symmetric  and  positive  definite.  A standard  theorem  in  matrix 
theory  is  that  for  any  symmetric  positive  definite  A there  exists  a non- 
singular, lower  triangular  matrix  L such  that 

(28)  A = LLT 

In  the  case  of  our  block  tridiagonal  matrix  A the  matrix  L has  an  especially 
simple  form.  In  this  case  we  can  write  the  decomposition  (2g)  as 
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(29) 


T T - - . 

L1 

lJ  l‘x  0 0 

T T 

L21  L2 

i i n 

l2  32  

0 Lv„  L, 

T 

l„  : 

32  3 

0 

• • 

T 

I N- 1 

• • 

0 0 LV  XT  1 Lx, 

N,N-1  N 



All  block  elements  of  the  above  block  triangular  natrices  are  3x5.  ^Equat- 
ing block  elements  of  A with  the  corresponding  block  elements  of  LL1 , we 
find  the  following  set  of  equations. 


(30) 

Ai  - LiLI 

(31) 

T 

A12  = L1L21 

(32) 

T T 

A.  = L.  . .L!  . . + L.  L. 
l x,i-l  i,i-l  i i 

i=2,N 

(33) 

T 

A.  . . = L.  L.  . 

1,1+1  l l+l,i 

W’ 

II 

ro 

Since  the  matrices  are  lower  triangular  a Choleskv  deconposi tion  algorith 
is  used  to  solve  (30)  and  (32)  for  L^.  Also,  since  the  Li  are  lower  tri- 
angular, (31)  and  (33)  defining  the  off  diagonal  blocks  Li  i.j  are  solved  as 
triangular  systems  of  equations  for  the  columns  of  L{ 

The  square  root  decomposition  of  A allows  us  to  replace  the  set  of 
equations  A6x=y  given  by  (27)  by  two  sets  of  equations 

(34)  L y'o  y 

(35)  LT6x  = y* 


Hie  lower  triangular  system  of  equations  in  (34)  may  be  written  in  te 
the  3x3  set  of  block  elements  as 


(36) 

(37) 


LjXj  = y (i) 

Wi.l  = y 


rr.s  of 


593 


This  set  of  3x3  lower  triangular  equations  is  solved  sequentially  for  the 
vectors  yj . Similarly,  the  upper  triangular  system  of  equations  in  (35) 
can  be  written  in  terms  of  the  3x3  set  of  block  elements  as 


(38) 

(39) 


L;J  6xn 


yN 


T ' T 

L.  . fix.  = y.  .-L.  . , fix. 
l-l  l-l  'i-l  i,i-l  l 


This  set  of  equations  is  then  solved  sequentially  for  the  vectors  fix^ 

From  the  theory  of  weighted  least  sequares  we  know  that  the  covariance 
of  the  vector  estimate  of  x obtained  on  the  final  iteration  is  approximately 
A"*  obtained  on  the  final  iteration.  Due  to  the  method  of  solution  of  the 
equations,  A_1  is  not  available,  llov.'ever,  it  is  relatively  easy  to  obtain 
the  covariance  of  the  individual  vector  estimates  x^  from  the  block  elements 
of  the  square  root  matrix  L.  A“1  is  given  by 

(40)  A-1=L”TL_1 

It  is  easily  determined  that  the  block  elements  of  are  given  by 


(41) 

l:1  . l:1 

11  1 

i = l ,N 

(42) 

l:1.  , -l:1 

13 

L-  • 1 l ’! 

3,3-1  3-1 

j = 2, i 

The  covariance  matrices  of  the  estimates  x.  are 

i 


(43) 


cov 


(x.)  = a?1  = y lt!  lt1 

i ii  jai  ij  ji 


N 


-T  -1 


A computing  formula  for  these  3x3  covariance  matrices  is  obtained  by 
substituting  the  recursive  relations  for  Lrj  given  by  (42)  into  (43).  Thus 

(44)  a:1  -f  l:t,  . l:1.  , 

i-1*1-1  j=i.x  3,1-1 

MO  A — 1 r . -T  , -1  ,-T  -1 

(45)  A....—  ) L.  , . L.  . , + L.  . L.  . 

l-l, l-l  H . 1-1,1  3,1-1  l-l  l-l 


Using  (42) 
(46) 


-1 

A.  . . . 
l-l  ,1-1 


N 


l 

j=i 


L. 

l 


T 


) 


_T  -1  -1  -T 

h.  . L..  (L.  . L.  , )+  L.  . 
13  ji  ••  i,i-l  l-l  l-l 


L 


-1 

i-1 
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(47) 


Obvious ly, 
(48) 


4,  FRONT  END  EDIT.  Before  using  the  DOVAP  measurements  in  the 
batch  processor,  wild  observations  are  detected  and  deleted  arid  the 
variances  Ra(j),  which  are  used  in  forming  weights  for  the  batch  pro- 
cessor, are  computed, 

til 

For  the  a receiver  consider  the  set  of  observation  {Za (t j ,t j +i) } 
on  the  interval  C(ti,ti+Np)3 . If  no  observations  have  been  drop  locked 
for  this  receiver,  there  will  be  Np  equally  spaced  observations  in  this 
interval.  Let  Na  be  the  actual  number  of  observations  in  the  interval. 

A quadratic  curve  is  fitted  to  this  set  of  Na  observations.  Let 
2a(tj_i,tj)  denote  the  value  of  the  curve  at  tj  and  let  Za(tj_i,tj)  = 
Za(tj_i,tj)-Za(tj_i,tj)  be  the  residual  from  the  curve  fit.  The  set 
of  residuals  on  the  interval  [ (ti»ti+Np) ] are  tested  for  spurious  ob- 
servations using  the  sample  skewness  coefficient  and  the  sample 
Kurtosis  coefficient  b2.  These  statistics  are  given  by 

C I (Z  (t.  .,t.)-u  )3 
a h v a l-l*  i or 

(49)  ^7  = 

(N  -1)  s 3/2 

N I (Z  (t.  ,,t.)-y  )4 
a f v av  i-l1  x a 
i 

(N  -1)  s 4 
v a a 

2 

where  iia  and  sQ  are  the  sample  mean  and  variance,  respectively.  If 
either  or  b2  exceed  values  for  the  5%  significance  level  of  these 

statistics,  the  observation  farthest  from  the  mean  is  deleted.  A curve 
is  fitted  to  the  remaining  observations  in  the  interval  and  the  test  for 
spurious  observations  is  again  applied.  This  sequence  of  fitting  and 
testing  is  repeated  until  |f17jjjand  b2  are  less  than  the  values  correspond- 
ing to  the  5%  significance  levels  or  until  less  than  . 75Np  observations 
remain  in  the  interval.  When  no  more  observations  are  to  be  deleted, 
the  points  which  were  deleted  are  replaced  by  the  corresponding  values 
of  the  final  curve  fit.  In  addition  all  points  which  are  originally 
missing  from  the  interval,  i.e.,  the  drop  locked  points,  are  replaced 
by  their  corresponding  values  from  the  curve  fit.  If  more  than  . 2SNt> 
of  the  observations  in  the  interval  require  replacement,  all  observations 
in  the  interval  are  deleted  and  not  replaced. 


(50)  b2  = 


-1  ,-T 

i-l, i-l  “ i-l 

uT . , a:! 
1,1-1  11 

-1  = l"t  l’1 

‘NN  N N 

-1 

1-1 
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The  interval  (ti,ti,Np)  is  chosen  so  that  it  has  an  integral  number 
of  subintervals  each  having  Lp  observations.  The  observations  in  each  cf 
these  subintervals  are  summed  to  provide  loop  range  change  measurements 
to  the  batch  processor  which  are  .OSLp  sec  apart.  Tie  variance  of  these 
observations,  which  are, used  in  the  batch  processor,  are  comutc-d  as 
Ra(i)=LpSa4 (i)  where  s«^  is  the  sample  variance  computed  from  the  curve 
fit  on  the  interval  (ti,ti+ftp).  These  will  be  Np/Lp  consecutive  variances 
Ra(i)  which  will  be  equal  for  the  a receiver.  "The  above  editing  pro- 
cedure is  applied  to  each  receiver  and  to  a set  of  intervals  which  cover 
the  entire  trajectory. 

Replacement  of  wild  observations  is  not  in  itself  a desirable  pro- 
cedure and  is  certainly  not  necessary.  However,  in  the  DOVAP  batch 
processor  this  replacement  procedure  greatly  simplifies  the  operational 
logic  of  the  computer  program.  By  being  conservative  in  the  replacement 
of  observations  and  realizing  the  replacement  can  be  dangerous,  no 
apparent  difficulties  have  been  encountered. 
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5.  POINT  EDIT  AND  RliCEIVf-R  EDIT.  Although  the  front  end  edit  has 
deleted  most  the  spurious  observations,  some  may  still  remain.  More 
importantly,  a receivers  observations  may  be  biased  either  over  a small 
portion  of  the  trajectory  or  over  the  entire  trajectory.  The  point  edit 
and  receiver  edit  are  designed  to  delete  observations  in  these  categories. 

Following  the  front  end  edit  and  iteration  of  the  batch  processor 
to  convergence,  the  point  edit  procedure  is  applied  to  each  receiver 
observation  used  in  the  batch  processor.  At  each  trajectory  time,  ti, 
the  normalized  residuals 


r (i) 

(51)  r*  (i)  = -2L 

a A HIT 

a 

are  examined  for  each  receiver  a used  in  the  batch  processing.  The  residual 
ra(i)  is  defined  by  (26)  and  is  the  residual  computed  in  the  last  iteration 
of  the  batch  processor.  If  jra(i)|>5,  corresponding  observation  is  deleted 
permanently  from  the  solution.  The- observation  corresponding  to  the  largest 
of  the  |r*(i)|  such  that  3<|r*(i)|<  5 is  deleted  temporarily.  After  exam- 
ining residuals  for  the  entire  trajectory  and  deleting  the  ones  dictated  by 
the  above  tests,  the  batch  processor  is  reiterated  to  convergence.  Add- 
itional observations  are  then  removed  according  to  the  above  criteria.  The 
iteration  and  point  edit  cycle  is  repeated  until  |r*(i)|<3  for  all  remaining 
residuals . 

When  the  above  iteration  and  point  edit  cycle  is  completed,  a receiver 
edit  identifies  receivers  suspected  of  being  biased.  The  receiver  bias  edit 
computes  the  statistic 


a 


for  each  receiver.  The  sum  in  the  above  equation  is  over  all  residuals  for 
the  a*-^1  receiver  which  were  not  deleted,  either  temporarily  or  permanently 
by  the  point  edit.  Ma  is  the  number  of  terns  in  this  sum.  If  rn  = M^xjma|>3, 
the  receiver  corresponding  to  the  value  m is  considered  to  be  biased  and  is 
deleted  from  the  batch  processing  solution.  The  iteration  and  point  edit 
cycle  is  then  repeated  without  this  receiver.  All  residuals  which  were 
temporarily  deleted  on  the  previous  point  edit  cycle  are  restored  to  the 
solution.  The  receiver  edit  is  repeated  following  an  iteration  and  point 
edit  cycle  until  no  additional  receivers  are  suspected  of  being  biased. 
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6,  TRAJECTORY  FILLING.  The  DOVAP  batch  processing  solution  produces 
a trajectory  position  solution  at  time  points  spaced  .OSLp  sec  apart.  The 
number  Lp  is  usually  chosen  so  that  these  time  points  are  1-2  sec  apart. 

It  is  obvious  that  a 1-2  sec  interval  between  trajectory  position  solutions 
is  unacceptable  for  most  purposes.  This  is  particularly  true  if  velocity 
and  acceleration  states  are  to  be  derived  from  the  position  solution.  Al- 
though we  could  set  Lp=l  so  that  a batch  processor  solution  is  computed  at 
every  DOVAP  sample  time,  this  is  undesirable  since  the  CPU  time  and  mass 
storage  used  by  the  DOVAP  EOT  program  is  greatly  increased,  A procedure 
described  below,  which  we  call  trajectoiy  filling,  is  used  to  add  position 
solutions  between  the  batch  processor  position  solutions.  Experimental 
results  show  that  there  is  very  little  difference  between  the  position 
solutions  obtained  by  trajectory  filling  and  those  obtained  by  setting 
Lp*l , 

Let  the  DOVAP  sample  times  between  batch  processor  times  t^  and 
be  denoted  by  ti, i,tif 2 , . . . >ti, Lp-1  and  the  corresponding  trajectoiy  position 
vectors  by  Xj[ ^x^ • • • *xi , Lp-1*  least  squares  position  solution  for 

computing  x;  -j+i,  given  • is 

$ J * f J 


s 2 (i) 

a 


(x.  . ) r„(t.  . ,t.  . 

i»r  0 1,1  i,j+i 


where 

(54) 


M=  y G 0(x.  . 

“ sa*(i)  a2v  1,J 


(x.  .)  G _(x.  .) 


The  sums  in  the  above  two  equations  are  over  all  receivers  whose  observations 
were  in  the  final  batch  processor  iteration  for  the  time  interval  tj[  to  tj>i. 
The  quantity  s£(i)  is  the  sample  variance  for  the  interval  t^  to  ti+i  which 
was  computed  in  the  front  end  edit. 
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APPENDIX  A 


DOVAP  REFRACTION  CORRECTION.  The  refraction  correction  applied  to 
DOVAP  is  considerably  more  conniex  than  that  normally  applied.  Since  the 
ranges  involved  in  DOVAP  tracking  are  relatively  short,  a flat  Earth  ray 
tracing  procedure  can  be  used.  Also,  since  DOVAP  data  is  normally  used 
only  when  the  elevation  angles  are  sufficiently  large,  say  greater  that 
10°;  an  approximation  to  the  ray  trace  procedure  is  sufficiently  accurate. 

In  the  figure  below  the  DOVAP  transmitter  is  located  at  T and  the 
receiver  at  R.  The  target  is  at  M travelling  with  velocity  V^. 


xT(rM)  and  tR(rM)  are  unit  vectors  tangent  at  the  target  to  the  ray  paths 
from  transmitter  to  target  and  target  to  receiver,  respectively. 

For  an  isotropic  medium  and  for  targets  with  speed  |Vfi|«  c,  the 
doppler  frequency,  fp,  at  the  receiver  can  be  expressed  in  terms  of  the 
phase  path  length  L, 


(55) 


dL 

3t 


where  fo  is  the  transmitted  frequency.  The  phase  path  length  for  the  above 
figure  is 


(56) 


M 

n(r)ds  + 
T 


fR 

n (r)ds 
M 


where  n (r)  is  the  index  of  refraction  at  a point  on  the  rav  path.  The  first 
term  in  (56)  is  the  phase  path  length  from  transmitter  to  target  and  the 
second  term  is  the  phase  path  length  from  target  to  receiver.  The  integration 
is  to  be  carried  out  along  the  ray  path. 

The  element  of  path  length  ds  may  be  written  as 

(57)  ds  = x(r)  • dr 

where  t (r)  is  a unit  tangent  vector  to  the  ray  path  at  point  r. 


The  path  length  can  then  be  written  as 


(58) 


L = 


rM(t) 

n(r)t(r)  • dr  + 


rR 

n(r)t(r)  • dr 

)_ 

^ (t  ) 

Differentiating  (58)  with  respect  to  t and  ignoring  ~ 

O t 

dr 


(59) 


3F- 


M-  - _ ^rM+ 

- n(rM+)T(rM+)  • ^ 


where  the  - and  + refer  to  the  transmitter  and  receiver  sides  of  the  target , 
respectively*  [59)  can  be  rewritten  as 

(6°)  dt  = n^rM^TT^P’TR^p)  * VM 

(60)  shows  that  the  received  doppler  signal  is  dependent  on  the  refraction 
effects  of  the  atmosphere  in  two  ways.  First,  note  that  the  received  cou- 
pler is  proportional  to  n(ry),  the  coefficient  of  refraction  at  the  target. 
Second,  the  received  doppler  is  not  quite  proportional  to  the  radial  velocity 
as  is  assumed  since  the  unit  vectors  t„  and  Tj  are  not  along  the  sight  line- 
vectors. 
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Let  tj  (rM)  and  xR  (rM)  he  unit  vectors  along  the  sight  lines 
from  transmitter  to  target  and  target  to  receiver,  respectively.  The 
geometric  situation  is  shown  below. 


The  unit  sight- line  vectors  can  be  written  as 


(61) 

(62) 


(flfV 


1 W 


where  rp  and  are  position  vectors  of  the  transmitter  and  receiver, 
respecti vely , Let  v^^  be  the  observed  doppler  velocity  given  by  (60) 
and  let  vcj  be  the  doppfer  velocity  if  there  were  no  refraction  effects, 
vj  is  given  by 


(63) 


vd  ■ 


R } 


'M 


We  can  write  v.  in  terms  of  v,  n 
d doBs 


as 
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(64) 


where 

(65) 


vdoBs 


"(rMJ 


4vd 


4vd  “ 


(v'41}) 


M 


■ t?R-'TRU) 


In  order  to  correct  for  refraction  in  DOVAP  or  equivalently  to  correct 
vdoBs  t0  vd  shows  that  we  first  divide  v<j0gs  by  the  coefficient  of 
refraction  at  the  target  and  then  substract  Av^, 

In  order  to  compute  • V»q  needed  in  the  computation  of 

Av^,  consider  one  leg  of  the  DOVAP  propagation  path  shown  below 


EQ  is  the  observed  elevation  angle  and  Ep  is  the  true  elevation.  The 
elevation  refraction  correction  is  AE*Ej-E0.  The  angle  <5y  is  given  by 


(66) 

9 

et’em 

(67) 

6\p  * 

WW 

(68) 

6\p  ■ 

OVrE0) 
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denoting  (Em-Eq)  by  AE^ 


(69) 


6<p  ~ AE-AE 


M 


Both  AC  and  AE^  are  available  from  a ray  trace. 


Let  s^>  be  an  orthonormal  set  of  base  vectors  with  s^  along  the 
sight  line  vector  from  transmitter  to  target  and  in  the  vertical 
plane  containing  In  terms  of  these  base  vectors  xj-^and  can 


be  written  as 
(70) 


T^1}=  [1  o 0] 


_T1 

*T2 

ST3 


(71) 


x.p  « [cos  0-sin6$„] 


T1 

[12 

;T3 


Since  6i{/^  is  a small  angle  (71)  can  be  approximated  as 


(72) 


xT  ^ [1  0-6^3 


T1 


ST2 

ST3 


The  velocity  vector  V,.  can  also  be  renresented  in  the  s„>  basis  as 

M - i 


(73) 
where 

(74) 


V a <S  V > 

VM  T M 


VM  > = M . 
M s^L 


X 

# 

y 

z 
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N's-j-L  t^e  rotati°n  matrix  from  the  launcher  coordinate  system  to  the 

sight  line  coordinate  system  defined  by  Sy>  . x,y,  and  i are  the  coordinates 

of  the  velocity  vector  in  the  launcher  coordinate  system.  The  matrix 

M may  be  computed  bv  rotating  thru  an  azimuth  angle 
s ^ L 


(75) 


aT  = tan 


_1  X_XT 


and  then  thru  an  elevation  angle 


(76) 


eT  = tan 


-1 


z-z. 


[(x-x^fy-y^V^ 


In  terms  of  these  angles  M .is 

St*-* 


(77) 


cosot,, 


cose^sina^, 

-sine^sina^. 


-sina„  0 


cose„cosa_  sine„ 

1 T j 

-sine,fcosaT  cose^ 


Using  (70) , (72) , (74) , and  (77)  the  contribution  of  the  transmitter  leg 
of  the  propagation  path  to  the  refraction  correction  Av^  is 

(78)  (x^-rl1^)  • VM  % 6V»t(-x  sineTsinc»T-y  sine,j.cosaT+z  coscT) 

A similar  equation  holds  for  the  receiver  leg  of  the  ray  path 

(7y)  -(Tr-Tj^  ) ' V £ -<$i/iR(-x  sineRsinctR-y  sineRcosaR+z  cosep) 

The  correction  Av^  given  by  (65 j can  now  be  computed. 
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OPTIMAL  DOVAP  INSTRUMENTATION  PLANNING 


William  S.  Agee  and  Jerry  L.  Meyer 
US  Army  White  Sands  Missile  Range,  NM  88002 


ABSTRACT . A DOVAP  instrumentation  planning  procedure  has  been 
developed  which  selects  a nearly  geometrically  optimal  set  of  M receiver 
sites  from  over  600  sites  presently  available  at  WSMR.  The  criterion 
used  for  selection  of  the  sites  is  the  minimize 

N 

E w.trVi 

i-1  1 

where  Vi  are  the  3x3  diagonal  blocks  of  the  3Nx3N  error  covariance  matrix 
V which  would  result  if  the  data  from  the  receivers  were  batch  processed 
to  obtain  vector  position  estimates  xp,  i=l,N  for  N points  entirely  cover- 
ing a nominal  trajectory.  The  wp>o  are  weights  used  to  denote  the  rela- 
tive importance  of  precise  estimates  at  each  of  the  N trajectory  points. 
The  final  output  of  the  selection  procedure  is  a list  of  the  M receiver 
sites,  the  trajectory  segments  in  which  each  of  these  receivers  should 
produce  usable  doppler  data,  the  geometrical  covariance  for  each  tra- 
jectory point,  and  a computer  plot  of  the  receiver  sites  along  with  the 
ground  track  of  the  nominal  flight  path. 

1.  INTRODUCTION.  The  DOVAP  instrumentation  planning  procedure 
described  in  this  report  is  designed  to  assist  in  the  development  of  a 
DOVAP  Instrumentation  Plan  (IP)  for  a given  nominal  trajectory  or  flight 
path.  The  technique  produces  a locally  optimal  selection  of  M existing 
receiver  sites  based  on  geometrical  considerations.  Since  other  signi- 
ficant factors  such  as  multipath,  power  requirements,  site  accessability , 
etc.  are  also  important  in  the  preparation  of  a DOVAP  instrumentation 
plan,  consideration  of  the  factors  may  require  alteration  of  the  geomet- 
rically optimal  instrumentation  plan.  For  this  reason  the  computer  pro- 
gram also  has  a compare  mode  to  determine  the  geometrical  degradation 
between  the  optimal  plan  and  the  altered  plan. 

Given  a nominal  flight  path  specified  by  the  position  vectors  xp , 
i=l,N  along  the  path,  the  receiver  selection  process  minimizes 

N 

E w. tr  cov (x. ) 
i-1  1 

where  cov(x|)  is  the  3x3  covariance  for  an  estimate  of  x^,  which  would 
result  from  estimating  the  trajectory  with  the  DOVAP  BET  program  [ 1 ] . 

The  wj  are  positive  weights  used  to  emphasize  some  trajectory  points 
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more  than  others.  The  most  common  choice  for  wi  is  w^=l  for  all  i,  i.e. 
each  trajectory  point  has  equal  weight  in  the  receiver  selection  process. 
cov(xj)  used  in  the  criterion  function  is  a purely  geometric  covariance, 
i.e.,  all  receiver  noise  covariances  are  unity. 

Selection  of  the  best  possible  set  of  M DOVAP  receivers  from  the  650 
receiver  sites  presently  available  at  WSMR  is  a very  large  combinatorial 
programming  problem.  Enumeration  and  examination  of  all  possible  sets  of 
M receivers  for  a given  flight  path  is  obviously  an  impossible  computing 
problem.  Several  methods  for  the  solution  of  integer  and  combinatorial 
programming  problems  are  available  in  the  literature  on  Operations 
Research.  However,  the  author  has  not  been  able  to  formulate  the  present 
problem  in  a way  which  would  be  amenable  to  solution  by  these  methods. 
Rather  than  pursuing  the  globally  optimal  set  of  M DOVAP  receivers,  we 
will  find  a locally  optimal  set  of  receivers  based  on  a plan  improvement 
algorithm.  Starting  with  some  arbitrary  initial  IP  having  M receivers, 
we  make  successive  interchanges  between  receivers  presently  in  the  IP  and 
those  outside  the  plan  until  no  further  improvements  are  possible.  At 
each  stage  of  the  improvement  algorithm  the  interchange  is  made  which 
results  in  the  greatest  improvement  to  the  IP  according  to  the  minimiza- 
tion criterion.  The  resulting  plan  is  only  locally  optimal  in  the  sense 
that  it  is  dependent  on  the  initial  plan  with  which  the  algorithm  start- 
ed. Starting  with  a different  initial  IP  we  might  achieve  a different 
final  IP  having  a larger  or  smaller  value  of  the  criterion  function. 

Having  started  with  different  initial  IP's  in  a couple  of  cases,  final 
IP's  were  reached  which,  although  they  were  slightly  different  in  composi- 
tion, have  nearly  the  same  values  for  the  criterion  function. 

The  algorithm  is  relatively  economical  in  terms  of  CPU  time,  requir- 
ing 2-3  minutes  for  a plan  with  eighteen  receivers.  The  major  factor  in 
determining  the  computer  time  used  in  selecting  the  plan  is  not  the 
number  of  receivers  in  the  plan  but  the  number  of  receivers  which  must  be 
considered  as  possibly  being  added  to  the  plan  for  improvement.  The  out- 
put of  the  program  implementing  the  algorithm  is  a set  of  M receiver 
sites,  the  value  of  the  criterion  function,  the  differential  contribution 
to  the  criterion  function  for  each  receiver  at  each  trajectory  point,  the 
trajectory  intervals  during  which  each  receiver  should  produce  usable 
data,  and  a computer  plot  of  the  receiver  and  transmitter  sites  along  with 
the  ground  track  of  the  nominal  flight  path. 

2.  DOVAP  MEASUREMENT  MODEL.  The  DOVAP  measurement  system  is  a two 
way  CW  doppler  system  which  measures  a loop  range  change  or  equivalently 
an  average  loop  range  rate  of  a target  between  consecutive  sampling  times 
ti  and  tf+i.  The  normal  sampling  interval  is  50  msec.  A DOVAP  instru- 
mentation path  consists  of  a transmitter  station  and  usually  ten  or  more 
receiver  stations.  In  addition  the  target  being  tracked  carries  a fre- 
quency doubling  transponder.  The  DOVAP  receiver  station  receives  both 
the  reference  frequency  from  the  transmitter  and  the  frequency  doubled 
signal  from  the  target  transponder.  A frequency  comparison  of  the  two 
signals  and  the  addition  of  a 5KHZ  bias  signal  is  performed  at  the 
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receiver.  The  resulting  signal  is  converted  to  digital  form  by  counting 
the  doppler  cycles  over  a 50  msec  time  interval.  The  resulting  biased 
doppler  measurement  is 

CD  m(t.+1)  = 14'72194  sCWp)  + 50278  + Ni+i 

where  s(tp,  tp+p)  is  the  refraction  distorted  change  in  loop  range  over 
the  50  msec  interval.  The  loop  range  is  defined  as  the  sum  of  the  range 
from  transmitter  to  target  and  the  range  from  target  to  receiver.  Np+i 
is  a measurement  error  which  is  assumed  to  be  random  with  zero  mean. 

Thus,  except  for  refraction,  for  which  a correction  is  applied,  DOVAP 
measurements  are  considered  to  be  unbiased.  Subtracting  the  bias  50278 
and  dividing  by  14.72194  yields  the  modified  DOVAP  measurement  m'(ti+i) 

(2)  ■ m'(ti+1)  = sCt.,tltl)  ♦ NU1 

After  correcting  for  refraction  we  have  the  measurement  model  for  the  ath 
receiver. 

(3)  z (t.,t.  .1  = g (x. ,x.  .)  + v (i+1) 

v J or  i*  l+l-  bcr  i’  l+r  or  1 

where  g(xi,xi+i)  is  the  loop  range  change  from  ti  to  tp+i  and  the  argu- 
ments xi  and  xp+p  indicate  the  dependence  on  the  position  coordinates  at 
these  times,  xp  is  the  position  vector  of  the  target  in  the  launcher 
coordinate  system. 

The  loop  range  change  is  modelled  as  follows.  Let  (xT,yp,zp)  be  the 
coordinates  of  the  electrical  center  of  the  ground  transmitter  antenna  in 
the  launcher  coordinate  system.  Also,  let  (xR,yR,ZR)  be  the  coordinates 
at  the  electrical  center  of  the  ground  receiver  antenna.  The  z coordinates 
of  the  surveyed  transmitter  and  receiver  sites  are  modified  to  obtain  zj 
and  zr.  The  amount  of  modification  depends  on  the  type  of  antenna  used, 
let  (x,y,z)  be  the  launcher  coordinates  of  the  target  transmitting  and 
receiving  antennas.  Let  ,Rp  be  the  range  from  ground  transmitter  antenna 
to  target  receiving  antenna  and  Rr  be  the  range  from  target  transmitting 
antenna  to  ground  receiver  antenna.  Rr  and  Rp  are  given  by 

(4)  ^(t.)  = [Cx(ti)-xr)2+(y(t.)-yT)2+Cz(t.D-zT)2]^ 

(5)  RR(t.)  = [(x(ti)-xR)2+(y(t.)-yR)2+Cz(t.)-zR)2]32 
Then  the  loop  range  change  g(xp,x^+^)  is 

(6)  sCXi^)  = RpCtp+p)  + RRCti+i)  " RT^i^  " RR^i^ 
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The  partial  derivatives  required  for  processing  DOVAP  observations 
using  the  above  measurement  model  are 


(7) 
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4.  RECEIVER  EFFECTIVENESS.  The  covariance  matrix  A--*-  of  the  DOVAP  batch 
processor  solution  J_2 J may  be  written  in  the  elementary  form 


(9) 
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J (x.  ,x. ) 

a i-I  i 
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a i-l’  iJ 
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where  J fx.  ..x.)  is  the  3N  vector 
a i-l  iJ 
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i£  the  ath  receiver  is  used  in  the  ith  interval  and  Ja(xi-1 Jxi)=0  if  the 

T 

ath  receiver  is  not  used  in  the  ith  interval.  Gaj(xi_i)  is  the  partial 
derivative  of  the  DOVAP  measurement  function  ga(xi-l>xi)  with  respect  to 

T 

its  first  argument  and  Ga2(xi)  is  the  partial  derivative  with  respect  to 
the  second  argument.  A receiver  may  be  used  in  the  ith  interval  only  if 
its  tracking  elevation  6a(i)  at  the  midpoint  of  the  interval  satisfies 
the  condition  8i_<  0a(i)  < 02-  01  and  02  are  input  to  the  program  with  9l 

normally  in  the  range  10^-20°  and  02  normally  in  the  range  70° -80°.  The 
effect  of  adding  or  deleting  a DOVAP  receiver  from  the  kth  interval  is 
easily  written  using  the  vectors  Ja(xi_i  ,xi) . Let  A+lbe  the  covariance 
matrix  after  adding  a receiver  8 to  the  kth  and  let  A^l  be  the  covariance 
after  deleting  a receiver  from  the  kth  interval.  A+land  A-l  are  given  by 


(11) 


(12) 


A ljp^xk-l*V  ’V3Ck-l’xk-)  A 


1 . A- 


Je(xk-i’V 


A J3^Xk-l,Xk^  Jg^xk-l,Xk^  A 

1 " tVXk-l,'V  A J8(xk-l,Xk') 


Thus  the  effect  of  adding  or  deleting  a receiver  8 in  the  k 


th 


interval  is 
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(13) 


E(M)  - ; 

x — Q 


where  the  V(k,3)  is  a 3N  vector.  The  upper  sign  indicates  addition  of  a 
receiver  and  the  lower  sign  deletion  of  a receiver. 


(14) 

and  q is  the  scalar 


V(k,0)  = A-1Je(xk.1,xk) 


(i5)  , - 


Partition  V(k,B)  into  its  3x1  subvectors 
(16) 

VjCM) 


V(k,B) 


V9  (k,B) 


Vk’3) 


The  3-vectors  Vj(k,3)  are  easily  computed  as 


(17) 


Vj(k,g)  = A j ,k-lG$l^xk-P  + A j,kGB2(xk:) 


where  A.!  are  3x3  block  elements  of  A The  scalar  q is  given  by 

lj  Mo. 


(18) 


q = Gg1(^k_1)  Vk_1(k,$)  + Cg2(xk)  Vk(k,6) 


The  effect  matrix  E(k,3)  can  be  written  in  terms  3x3  block  elements 
Eij(kf0) 


(19) 

Define  the  vector 


E..(k,B)  = + 


v.(k,3)  v:  (k,g) 

— - 
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(20) 


V. 


V.(k,3)  - 


C,B) 

±~<T 


Then 

(21)  E. .(k,B)  = * V . (k,B)  V*T(k,8) 

This  expression  for  the  effect  of  adding  or  deleting  the  3t^1  receiver  in 
the  kth  interval  is  used  to  modify  the  covariance  matrix  A-l.  The  above 
equations  are  used  sequentially  to  add  or  delete  the  3^  receiver  for  al 
intervals  k=2,N.  In  adding  or  deleting  in  the  (k+])st;  interval,  A is 
the  modified  covariance  obtained  in  adding  or  deleting  from  the  kth 
interval. 


The  effectiveness  of  a receiver  is  defined  as 


(22) 

Using  (13), 
(23) 


e($)  = 

(16) , and 
e(B)  = 


N 

E TrE(k,$) 
k=2 

(20)  in  (22)  e(B)  can  be  written  as 


E E Tr  V.  (k,B)  V.‘(k,B) 
k=2  i=l 


or 

(24) 


e(3) 


IN  .M  * 

E E V. 1 (k,3)  V (k,B) 
k=2  i=l  1 1 


More  generally,  the  receiver  effectiveness  may  be  defined  giving  more 
importance  to  some  trajectory  points  than  others  by  using  weights  >'.^>0. 
Then  we  define  the  receiver  effectiveness  as 


(25)  e(3)  = E E W V*T(k,B)  V*(k,B) 

k=2  i=l  1 1 

Another  quantity  which  is  useful  in  analyzing  the  effects  of  a receiver 
in  a DOVAP  IP  is  the  differential  effect  of  a receiver  which  is  defined 
as  the  increase  in  the  diagonal  elements  of  A~1  if  the  receiver  is  delet 
ed  from  the  IP.  Thus  the  differential  effect  is  the  diagonal  elements  o 

N 

(26)  E(3)  = l E (k,3) 

k“2 
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Also  of  interest  are  the  quantities 

N *2 

(26)  e,(k,3)  = E V ;(k,B)  j=l,2,3 

J i=l  J 

the  numbers  e-j(k,3)  for  a given  interval  k show  the  importance  of  the 
observations  from  receiver  3 during  that  interval. 

5.  OPTIMAL  RECEIVER  SELECTION.  The  optimal  receiver  selection 
procedure  chooses  a set  of  M existing  receiver  sites  which  minimizes 

N 

(27)  c = E w.tr  cov(x.) 

. . i l 

i=l 

The  quantity  c may  be  interpreted  as  a weighted  sum  of  the  error  esti- 
mates which  would  be  obtained  if  loop  range  change  observations  of  a 
nominal  flight  path  from  a set  of  m receivers  were  processed  with  the 
DOVAP  batch  processor.  The  choice  of  m (10-20)  receivers  from  the  650 
receivers  presently  available  at  WSMR  is  a very  large  combinatorial  pro- 
gramming problem  which  is  prohibitive  from  a computational  point  of  view. 
Even  if  the  number  of  available  receivers  is  considerably  reduced  by  some 
intelligent  screening  procedures,  the  magnitude  of  the  problem  is  still 
prohibitive.  Rather  than  trying  to  select  the  set  of  receivers  which 
achieves  the  global  minimum  of  c we  will  use  an  IP  improvement  algorithm 
to  obtain  a local  minimum  of  c. 

The  IP  improvement  algorithm  selects  receivers  from  an  instrumenta- 
tion planning  pool  (IPP)  . The  IPP  is  a set  of  receivers  which  is  obtain- 
ed from  the  set  of  all  existing  sites  by  placing  constraints  on  the  track- 
ing elevation  of  a receiver  and  on  the  transmitter  reference  signal 
strength  available  at  a receiver  site.  The  following  constraints  are 
used  to  develop  the  IPP.  Let  Ta=l  if  the  ath  receiver  can  receive  a 
sufficiently  strong  36.2  MHz  reference  signal  from  the  transmitter.  Other- 
wise, TcrO.  The  values  of  Ta  for  all  receivers  and  transmitters  were 
furnished  by  the  DOVAP  instrumentation  system  personnel  from  tests  con- 
ducted to  determine  signal  strength.  The  72.4  MHZ  antennas  used  at  the 
DOVAP  receiver  sites  may  have  nulls  at  low  and/or  high  elevation  angles 
of  the  incoming  72.4  MHZ  signal.  Thus,  in  order  to  insure  a sufficient 
72.4  MHZ  signal  strength  at  the  receiver,  the  elevation  angle  of  the  line 
of  sight  between  the  receiver  and  the  trajectory  should  be  between  0q  and 
02-  The  minimum  01  and  the  maximum  angle  02  are  a function  of  range  be- 
tween receiver  and  target.  For  each  receiver  a let  RGa  be  The  shortest 
ground  range  for  any  of  the  points  on  the  nominal  trajectory. 


(28) 
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where  (xi,yi,Zi)  i=l,N  are  the  coordinates  of  the  trajectory  points  and 
(xRa>>'Ra> zRa)  are  t^‘e  coor^inates  of  the  a^h  receiver.  Let  Ea  be  the 
elevation  of  the  line  of  sight  from  ath  receiver  to  the  trajectory  point 
corresponding  to  RGa. 


(29) 


E = tan 
a 


-1  ^Zi~z^cP 


Ja 


The  IPP  is  given  by 

(30)  IPP  .-falVi.SjiE^ep 


Let  M be  the  maximum  number  of  DOVAP  receiver  sites  we  wish  to 
select  and  let  Mj<M  be  the  number  of  receivers  used  to  start  the  selection 
procedure.  Mi  may  be  considered  as  the  minimum  number  of  receivers  to  be 
selected.  Given  an  arbitrary  initial  IP  having  Mi  receivers  the  SELECT 
program  will  construct  an  IP  having  Mi+1  receivers  by  adding  the  receiver 
from  those  available  in  the  IPP  which  results  in  the  greatest  decrease  in 


N 

c = Z w.tr  covfx.) 
i=l  1 


The  SELECT  program  then  deletes  the  receiver  from  this  modified  IP  which 
results  in  the  smallest  increase  in  c.  This  exchange  procedure  between 
the  IPP  and  the  IP  is  continued  until  no  further  improvement  can  be  made. 

The  exchange  procedure  terminates  when  the  received  added  to  the  IP  is 

identical  to  the  receiver  deleted  from  the  IP.  The  final  set  of  receivers 

in  the  IP  is  the  best  set  of  Mj  receivers.  To  form  the  best  set  of  Mi+1 

receivers  start  with  the  IP  formed  by  adding  the  receiver  added  and 
deleted  at  the  termination  of  the  previous  stage  to  the  best  set  of  Mi 
receivers.  Proceed  with  the  exchange  process.  Similarly,  the  best  IP!s 
having  Mj  + 2,  Mi  + 3,  • • * , M receivers  are  obtained.  The  following  flow 
chart  may  clarify  the  above  description  of  the  instrumentation  plan 
improvement  algorithm . 
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ADO  RECVR  CORRESPONDING  TO  S+  TO  IP  AND 
REMOVE  6+  FROM  IIP 


COMPUTE  COVARIANCE  MATRIX  A*'  AND  tiA‘1 
FOR  NEW  IP 
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PROVING  PROGRAMS  CORRECT 


ELWOOD  D.  BAAS 

ARMY  MISSILE  TEST  AND  EVALUATION 
WHITE  SANDS  MISSILE  RANGE,  NEW  MEXICO  88002 

ABSTRACT 

Mathematical  methods  o£  proving  programs  correct  have  recently 
been  investigated  by  several  interdisciplinary  groups.  This 
effort  has  been  motivated  by  the  fact  that  some  computer  applica 
tions  are  being  restricted  in  important  areas  because  of  the 
inability  to  design  and  implement  software  programs  which  can  be 
shown  convincingly  to  be  correct.  It  is  also  recognized  that 
debugging  and  maintaining  computer  programs  are  two  very  serious 
and  costly  problems  facing  the  computer  industry.  This  paper 
provides  a survey  of  the  investigative  work  completed  to  date 
and  some  new  program  design  concepts  which  have  resulted  from 
the  effort. 
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PROVING  PROGRAMS  CORRECT 


Proving  programs  correct  (or  program  correctness)  can  rightly 
be  considered  as  part  of  what  is  currently  referred  to  as  structured 
programming.  The  other  areas  related  to  structured  programming  which 
are  discussed  in  the  literature  could  be  categorized  as:  programming 

methodology,  program  notation,  and  program  verification.  It  is  within 
this  general  context  that  we  shall  address  the  topic  of  program 
Correctness . 

Using  20/20  hindsight,  it  is  obvious  that  during  the  first  25 
years  of  programming  there  was  too  little  emphasis  placed  on  program 
correctness  and  too  much  emphasis  on  debugging.  As  the  size  and  cost 
°f  large  computer  programs  burgeoned  during  the  past  5-10  years,  more 
and  more  effort  was  expended  in  trying  to  develop  a degree  of  structur- 
al integrity  along  with  a given  computer  program.  In  most  cases  this 
Search  for  a viable  structure  was  motivated  by  a very  practical  reason: 
program  debugging  was  taking  so  long  that  the  program  or  system  became 
obsolete  before  the  program  was  checked  out! 

At  present  there  are  over  one  hundred  people  (worldwide) , mainly 
in  the  areas  of  Mathematics  and  Computer  Science,  who  are  actively 
engaged  in  the  areas  of  proving  assertions  about  programs.  Some  of 
the  primary  methods  of  formal  program  proof  construction  currently 
under  investigation  are: 

(1)  Inductive  Methods 

(2)  Calculus  Schemata 

, (3)  Graph  Techniques 

(4)  Recursive  Schemes 

(5)  Radix  Sorting  Techniques 

(6)  Algebraic  Models 

These  methods  involve  the  translation  of  the  program  to  higher  level 
semantics  which  then  can  be  used  to  verify  the  coding  logic  independent 
of  specific  inputs  and  outputs.  Thus  one  is  concerned  not  so  much  at 
how  a program  changes  values  of  variables , but  instead  at  how  relations 
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amoving  variables  remain  the  same.  At  the  higher  level  construct, 
Precise  definitions  of  completeness,  consistency  or  correctness  can 
be  applied  to  draw  conclusions  concerning  the  original  program  and 
its  input-output  relations.  Strong  and  Walker  of  IBM  summarize  this 
basic  notion  as  follows:  "Most  approaches  to  proving  programs' 

correct  concentrate  on  the  verification  of  input-output  relations, 
the  verification  techniques  being  independent  of  any  particular  attri- 
butes of  the  input-output  relation  in  question. "•! 

One  should  note  at  this  point  that  the  concept  of  formally  prov- 
ing a program  correct  is  radically  different  than  the  usual  process 
of  testing  or  debugging  a program.  Testing  can  prove  that  a program 
is  incorrect,  but  no  reasonable  amount  of  testing  can  ever  prove  that 
a non-trivial  program  will  be  correct  for  all  allowable  variations  of 
input  conditions.  Professor  Dijkstra  states  the  case  clearly:  "Pro- 

gram testing  can  be  a very  effective  way  to  show  the  presence  of  bugs, 
but  is  is  hopelessly  inadequate  for  showing  their  absence, 

There  are  some  serious  limitations  to  these  formal  correctness 
methods  as  far  as  practical  applications  are  concerned.  First,  there 
is  no  universal  higher  level  scheme  at  present;  each  construct  is 
dependent  on  the  particular  program  language  used.  Thus  each  correct- 
ness scheme  must  be  custom  built  around  a given  language.  In  fact, 
Ashcroft,  has  shown  that  the  limitations  and  possibilities  of  correct- 
ness methods  are  just  those  of  language  definition  methods.  As  a 
practical  example  of  a limitation,  consider  the  frequent  question 
concerning  program  proofs:  "What  about  overflow?"  The  answer  is 

either,  "this  particular  method  is  defining  a simple  language  with 
idealized  storage  registers"  or  "show  me  a language  definition  that 
considers  overflow  and  I will  construct  a correctness  method  which 
takes  care  of  it."^ 

A second  limitation  of  proving  assertions  about  programs  has  to 
do  with  program  size.  As  the  programs  become  larger,  the  proofs  become 
lengthy.  When  the  techniques  of  proof  become  long  and  detailed,  they 
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mav  themselves  contain  errors. .Tones  comments  nn  this  nrnhlgmi 


APPENDIX 


Dutch  National  Flag  Problem 


(This  programming  problem  and  solution  were  presented  by  Professor 
Dijkstra  at  the  Conference  on  Programming  Methodology  held  at  the 
University  of  New  Mexico,  Albuquerque,  NM,  Jan  7-H,  1974.) 

Problem:  Each  of  N buckets  contains  a single  red,  white,  or  blue 
pebble.  Using  a function  swap  (p,q)  which  permutes  pebbles,  and 
a function  look  (p)  which  identifies  the  color  of  a pebble,  write 
a program  which  will  rearrange  the  pebbles  so  that  red,  white,  and 
blue  are  separated  into  three  bins. 

Restrictions:  Each  pebble  may  be  looked  at  only  once.  Missing 

colors  are  allowed.  No  arrays  are  allowed. 

Solution: 


T T T 


r w b 


Initial  conditions:  r:  = 1;  w:  = N,  b:  = N 

Program:  do  w>r — > 

col : = look  (w) ; 

if  col  = red->  swap  (r  ,w) ; inc  (r)  Q 
col  = white— > dec  (w)Q 
col  = blue— > swap  (w,b) ; dec  (w,b) 
fi 
od 


Note:  This  program  is  machine  independent.  To  prove  this  program 

correct  one  needs  to  simply  check  that  the  "or  statements"  are 
exclusive  and  that  the  program  terminates. 
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ABSTRACT 

Conditions  for  generalized  plane  strain  in  an  elastic,  perfectly 
plastic  cylinder  subject  to  uniform  internal  pressure  are  derived  from 
specific  assumptions  concerning  the  displacements  and  the  end  condi- 
tions. Specifically,  we  assume  the  tangential  displacement  is  a 
function  of  the  radius  only,  and  the  axial  displacement  is  a function 
of  the  distance  from  the  diametral  plane  of  reference.  We  conclude 
immediately  the  shear  strains  and  rotations  are  zero.  The  shear 
stresses  are  also  zero  if  the  cylinder  is  free  of  residual  stresses 
in  its  original  state.  Symmetry  of  the  normal  stresses  follow  from 
arguments  involving  the  Prandtl-Reuss  flow  equations,  the  Von  Mises 
yield  condition,  and  the  elastic  behavior  of  the  plastic  zone  when 
subject  to  hydrostatic  pressure.  The  equilibrium  equations,  boundary 
conditions,  and  end  conditions  are  also  involved.  The  axial  strain 
is  found  to  be  independent  of  r and  z.  Three  equations  governing 
plastic  flow  are  derived  which  are  similar  in  form  to  those  given  by 
Prager  and  Hodge  for  the  plane  strain  condition.  These  equations  were 
solved  numerically  by  integration  along  the  characteristics.  An 
iterative  procedure  was  required  to  determine  the  radial  pressure  and 
axial  strain  at  the  elastic  plastic  interface  in  a manner  which  satis- 
fied both  boundary  and  end  conditions.  Numerical  results  are  presented 
for  a cylinder  with  a wall  ratio  of  two  for  both  open  end  and  closed 
end  conditions.  The  residual  stresses  which  remain  after  release  of 
the  hydraulic  pressure  were  also  calculated.  Axial  as  well  as  circum- 
ferential residual  stresses  are  produced.  Calculations  for  wall  ratios 
in  the  range  1. 5-3.0  were  carried  out.  Re-yielding  at  the  inner  sur- 
face will  occur  if  the  wall  ratio  exceeds  2.25;  the  stresses  remain 
elastic  for  smaller  wall  ratios  when  the  pressure  is  released. 


« 
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I.  INTRODUCTION 


In  this  paper  we  consider  axial,  circumferential,  and  radial 
stresses  in  an  elastic,  perfectly  plastic  cylinder  pressurized  inter- 
nally until  the  entire  cylinder  undergoes  plastic  yielding.  It  is 
assumed  the  cylinder  is  in  a condition  of  generalized  plane  strain,  the 
ends  of  the  cylinder  are  either  open  or  closed,  and  appropriate  boundary 
conditions  apply  at  the  cylindrical  surfaces.  The  mathematical  treat- 
ment is  a generalization  of  the  plane  strain  analysis  of  Prager  and 
Hodge.  The  numerical  analysis  is  based  on  integration  along  the  char- 
acteristics of  a system  of  quasi-linear,  hyperbolic,  partial  differential 
equations.  The  residual  stresses  which  exist  when  the  hydraulic  pressure 
is  released  are  also  calculated. 

The  analysis  begins  with  three  assumptions  concerning  the  dis- 
placement field  in  a long,  uniformly  pressurized  cylinder  originally 
free  of  residual  stresses,  and  deduces  the  nature  of  the  stress  and 
strain  fields  which  are  consistent  with  these  assumptions.  As  usual, 
boundary  conditions,  equilibrium  and  compatibility  equations,  and 
equations  giving  the  strains  as  functions  of  the  stresses  or  displace- 
ments lead  to  the  required  stress  and  strain  fields  in  the  elastic  case. 
Initial  conditions,  the  yield  condition,  and  equations  governing  plastic 
flow  are  essential  additional  requirements  for  the  plastic  zone.  The 
solution  obtained  from  these  relations,  together  with  the  original 
assumptions  concerning  the  displacements,  yields  a mathematical  solution 
which  is  internally  consistent.  Conditions  of  confined  plasticity  are 
assumed;  that  is,  the  plastic  strains  are  of  the  same  order  of  magnitude 
as  the  elastic  strains,  and  can  be  derived  from  the  displacements  in 
the  same  manner.  For  simplicity,  infinitesimal  analysis  is  used  through- 
out, perhaps  at  some  cost  in  realism  when  one  considers  the  significant 
elastic  strains  which  can  occur  in  a pressurized  cylinder  of  very  high 
strength  steel. 


II.  GENERALIZED  PLANE  STRAIN  IN  AN  ELASTIC  CYLINDER 

In  an  elastic,  isotropic  cylinder,  we  assume:  the  tangential  dis- 
placement is  zero,  the  radial  displacement  is  a function  of  the  radius 
only,  and  the  axial  displacement  is  a function  of  the  distance  from 
the  diametral  plane  of  reference. 

v = 0 (1) 

u = f(r)  (2) 

w = g(z)  _ (3) 

The  displacement -strain  relations,  equations  of  equilibrium,  and 
stress-strain  relations  are  used  to  show  that  the  stresses  and  strains 
are  axially  symmetric  and  torsion-free. 
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The  shear  strains  are  given  by 


9u  9v  v 

Yr0  r98  + 9r  r 

(4) 

9u  9w 

Yrz  9z  + 9r 

(5) 

_ 3v  1 3w 

Yz0  9z  + r 36 

(6) 

On  referring 

to  Eqs.  (1)  - (3),  we  see 

= 0 *1.  = 0 ^ = 0 
9r  J 30  ’ 9z 

(7) 

= o — = o 

90  ’ 9z 

(8) 

1^=0  — = 0 
96  * 9r 

(9) 

so  that 

yQ=0,  y =0,  yq  = o. 

r0  rz  z0 

(10) 

The  stress-strain  relations  for  shear  are 

Tr0  ^Yr0 ’ Trz  ^Yrz’  Tz0  ^Yz0 

(ID 

so  that 

T . = 0,  X =0,  T = 0 . 

r0  rz  z9 

(12) 

Eqs.  (7) thru  (12)  show  that  the  cylinder  does  not  undergo  shearing 
strains  due  to  torsion. 

Next,  we  use  the  stress-strain  laws  to  show  the  stresses  and 
strains  are  axially  symmetric.  The  normal  strains  are  given  by 

9u  u _ 

er  " 9r  » e0  = r ’ ez  " 9z  ? U (13) 

since  |^-  = 0.  The  stress-strain  equations  for  the  normal  stresses 
are  given  by 


1.  Timoshenko,  S.  and  Goodier , I.N, 3 Theory  of  Elasticity,  second 
edition,  McGrow  Hill  Book  Company,  Inc.,  New  York,  1951.  Pages 
305,  306. 
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(14) 


°r  = K^e0+er+ez^  + 3 G(2er"e0"ez-) 


°0  = + 3 G(2e0-ez'er) 


a = K(e„+e  +£)+■=■  G(2e  -e  -e  ) 
z v 0 r zJ  3 v z r 6' 


On  referring  to  Eqs.  (7),  (8),  and  [9),  we  see 


9e  9en  9e 

_Z=  0 -1=0  -1=0 

90  u>  90  u’  90 


and  consequently, 


The  normal  strains  and  stresses,  as  well  as  the  displacements, 
axially  symmetric.  We  also  find  that 

9 £«  9e  9 c 

_1=  o o -l=o 

9z  ’ 9z  ’ 9r  * 

The  equations  of  equilibrium  are  required  in  order  to  complete 
solution.  We  have 


9xr0 

9x 

rz 

a - a. 
r 0 

90  + 

9z 

r 

. 9x0z 

9a 

z 

i 

T 

rz 

• 90  + 

9z 

T 

r 

. 9°0 
r-r-  + 

3t 

6z 

+ 

n 

CD 

fH 

H 

Ol 

9r  r 90  9z 


which  reduce  to 


9o  o — o. 

— + — = 0 

9r  r 
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We  see  that  Eq.  (23)  is  redundant,  as  it  duplicates  information  given 
in  Eq.  (18) . 


From  Eq.  (16)  and  Eq.  (22)  we  find 

9e 


9z 


= 0 


so  that 


w = C + Dz  . 


(24) 


(25) 


An  equation  for  the  radial  displacement  is  found  from  Eqs.  (13), 
(14) , (15)  and  (21) . 


9r 


l 1 dll  U n 

: 1 i 5 "2  ° 0 

(26) 

r 

u = Ar  + B/r 

(27) 

0 = A + B/r2 

(28) 

= A - B/r2 
r 

(29) 

e = D . 
z 

(30) 

and  from  Eq.  (25) 


The  Lame'  formulas  for  the  stresses  are  obtained  from  Eqs.  (14),  (15), 
and  (16) 


where 


D' 


a = A'  + B'/r 
r 

a = A'  - B'/r' 

□ 


a = D' 

z 


K(2A+D)  + j G(A-D) 
B'  = -2GB 


K(2A+D)  - j G(A-D) 


(31) 

(32) 

(33) 

(34) 

(35) 

(36) 


The  constants  must  be  determined  from  the  boundary  conditions  which 
may  be  stated  either  in  terms  of  stresses  or  displacements.  We  also 
find 

9o 


9r 


0, 


9(Va0) 

3r 


= 0 


(37) 
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The  stress-strain  relations  given  by  Eqs.  (14),  (15)  and  (16) 
are  not  valid  in  an  elastic-plastic  cylinder,  so  a more  detailed  anal- 
ysis is  required  to  prove  the  stresses  and  strains  are  independent  of 
z and  0.  Eq.  (37)  is  also  invalid  for  a long  cylinder  when  plastic 
flow  occurs. 

The  existence  of  residual  stresses  resulting  from  prior  thermal  or 
mechanical  history  does  not  affect  the  preceding  formulas  provided  the 
total  stresses,  due  to  residual  stresses  and  due  to  loading,  remain 
within  the  elastic  limit.  We  assume  the  stresses,  strains,  and  dis- 
placements given  above  are  produced  by  external  loads  only. 


III.  GENERALIZED  PLANE  STRAIN  IN  AN  ELASTIC-PERFECTLY 

PLASTIC  CYLINDER 

Consider  a long,  hollow  cylinder  pressurized  internally  until  a 
plastic  zone  develops  near  the  inner  surface,  as  shown  below  in  Figure 
1.  We  neglect  conditions  near  the  ends  where  hydraulic  seals  and 
various  mechanical  attachments  will  produce  a complicated  two-dimensional 
stress  distribution.  A few  diameters  away  from  the  ends,  these  local 
effects  will  disappear,  and  the  assumed  conditions  governing  the  dis- 
placements, given  by  Eqs.  (1),  (2),  and  (3)  of  the  preceding  section, 
will  closely  approximate  the  actual  physical  conditions. 


NOTE:  EXTERNAL  CYLINDRICAL  JACKET  NOT  SHOWN. 


FIGURE  1.  Cylinder  Deformed  Plastically  by  Internal  Hydraulic 
Pressure 
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Eqs.  (7),  (8) , (9)  and  also  (13)  of  the  preceding  section  are 
valid,  as  they  do  not  involve  the  stress-strain  relations  or  the  yield 
condition.  Additional  analysis  is  required  to  show  that  the  shear 
stresses  are  zero. 


The  Prandtl-Reuss  flow  equation  is  assumed  to  govern  the  deformation 
in  the  shear  during  plastic  flow 


Gy 

r 

T + 

At 

(39) 

rz 

rz 

rz 

Se  ’ 

i 

T + 

Z0 

Se 

(40) 

Sr  = 

V + 

iTer  • 

(41) 

In  these  equations,  A is  an  unknown  parameter  depending  on  time  and  the 
coordinates  r,z,  and  0.  Since  the  shear  strains  are  zero,  we  have 


t + 
rz 


At 


rz 


At 


Z0 


At 


Gr 


= 0 
= 0 
- 0 . 


On  solving  we  find 


rz 


= a e 
rz 


-6 


where 


(42) 

(43) 

(44) 

(45) 

(46) 


We  assume  the  cylinder  is  free  of  residual  stresses  at  the  beginning 
of  pressurization.  Then  the  shear  stresses  will  be  zero  while  the  tube 
remains  elastic  and  during  incipient  plastic  flow.  If  we  measure  time 
from  the  beginning  of  plastic  flow,  we  have 

t =0,  t Q = 0,  rQ  = 0 when  t=0.  (47) 

rz  ’ z0  ’ 0r 


2 . Prager  , W.  and  Hodge,  Jr.  3 P.  G. , Theori[  of  Per  feet  ly  Plastic  Solids , 
Dover  Publications,  Hew  York,  1968.  Pages  16-32,  95-122. 

* In  this  section  the  superimposed  dot  means 
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Now  A is  a positive  scalar,  so  A>0  when  t>0.  Hence,  the  exponential 
factor e"*5  in  Eq.  (45)  cannot  vanish.  It  follows  that 


rz 


= 0, 


Z0 


0, 


0r 


= 0 


(48) 


and  consequently 


t =0,  x „ = 0,  xQ  = 0 for  t$0  . (49) 

rz  z0  0r 

Eq.  (49)  corresponds  to  Eq.  (12)  of  the  first  section.  Eqs.  (21), 

(22)  and  (23)  are  also  valid  since  the  shear  stresses  are  zero;  these 
equations  are  repeated  below  for  convenience. 
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As  mentioned  previously,  we  cannot  use  the  elastic  stress-strain  laws 
to  prove  the  stresses  and  strains  are  independent  of  z and  0.  However, 
if  we  add  Eqs.  (14),  (15)  and  (16),  we  find 


a + oa  + a = 3K(efl+e  +e  ) , (50) 
r0z  v 0 r z * v 

This  relation  between  the  sum  of  the  normal  stresses  and  the  sum  of 
the  normal  strain  is  valid  in  both  the  elastic  and  plastic  zones,  as 
it  does  not  involve  the  deviatoric  stresses  and  strains. 


The  first  equilibrium  equation  gives 

3o 

r 

oa  = r — — + a 
0 3r  r 

and  from  the  displacement  strain  we  have 

u 3u 

e0  r*  £r  ~ 8r 


so  that 


r 


+ ro 

z 


3K(r 


3u 

3r 


+ u + r 


3w 

3z 


) . 


(51) 


(52) 
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On  integrating  with  respect  to  r,  we  find 

r2ar  + Jra^dr  = 3K(ru+ J'r  dr) , 


Integration  by  parts  gives 


/ 


9w  , 1 2 9w 

r 97  dr  = 2 r 9?  - 2 


1 f 2 92w 

2 JT  9r9z 


dr 


The  last  integral  vanishes  since  we  assume 

9w 


9r 


= 0 , 


Hence, 


T\  +/razdr  = 3Kfru  + \ r2  * 


From  the  boundary  conditions 

o = - p.  at  r = a 
r ri 


(53) 


(54) 


(55) 


o =0  at  r • b 
r 


which  is  valid  for  the  part  of  the  cylinder  between  the  seals,  we  find 

pia2  + f razdr  = 3K^ru)  I + J (b2-&2)  -§7  ’ (56) 

•'a  a 

2 

Evidently  p.a  is  independent  of  z,  The  integral  is  proportional  to 
the  net  axial  load  applied  at  the  ends  of  the  cylinder,  and  must  be 
independent  of  z from  equilibrium  conditions.  The  term  on  the  right 
hand  of  the  equation  involving  u is  also  independent  of  z.  Hence, 


9w 

9z 


D 


and  the  axial  strain  is  independent  of  z.  The  conditions 


0 


define  the  conditions  for  generalized  plane  strain. 
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On  returning  to  Eq.  (50) , we  have 


so  that 


u 9u  . 

a + cta  + a =3K(— +-r— +e) 
r 0 z v r 9r  zJ 


3(or+oe) 

9z 


= 0, 


(57) 


We  use  the  Von  Mises  yield  condition  to  prove  that  a and  aQ  are 
independent  of  z.  In  the  absence  of  shear  stresses  the  Von  Mises  yield 
condition  is  given  by 

(c0-ar)2  + (°r-az)2  + (oz-o0)2  = 6k2  (58) 


Eq.  (58)  is  assumed  to  be  valid  for  the  entire  plastic  zone.  In  this 
equation,  the  normal  stresses  include  residual  stresses  as  well  as 
stresses  produced  by  the  external  forces  on  the  cylinder.  We  must 
assume  that  there  are  no  residual  stresses  when  the  cylinder  is  in  its 
original,  unloaded  state;  otherwise  the  use  of  the  Lame'  formulas  to 
calculate  stresses  at  the  elastic-plastic  interface,  to  be  discussed 
later,  would  be  invalid. 

°T  = °>  °Q  = °>  °z  = 0 (59) 


when  the  cylinder  is  in  its  original  elastic  state  and  not  subject  to 
external  loads. 

Differentiate  Eq.  (58)  with  respect  to  z;  we  find 


9a  9a 

t2VV°i>  sT  * (2or-ae-°z)  if  = 0 

9a 

since  — =0.  On  combining  Eqs.  (57)  and  (60),  we  find 


(60) 


9a,. 


9a 


(a  -a.)  — r — = 0,  (aQ-a  ) = 0 

r 0 9z  ’ v 0 r 9z 


(61) 


We  have  two  alternatives.  If 


then 


°e  * V 


30e 

9z 


9a 


0, 


9z 


(62) 
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If  on  the  other  hand 


oa  = a , 
0 x 


then  Eq.  (58)  gives 


°r  - °z  = ty%  k>  a0  - °z  = £vT k 


(63) 


3c 

so  that  Eq.  (62)  is  again  satisfied  since  — ^ =0.  Hence,  Eq.  (62)  is 
satisfied  under  all  conditions. 


A similar  line  of  reasoning  can  be  used  to  show  the  stresses  are 
axially  symmetric.  We  have  from  Eq.  (50) 


3(a  +c  ) 
z r 

36 


= 0 


Eq.  (58)  gives 


3a  3a 

(2az  ■ ar  " °e->  ~W  + (2ar  - °e  - az}  if  = 0 


(64) 


(65) 


Eqs.  (64)  and  (65)  give 


3a  3a 

(az  - ar)  if-  = °"  tar  • if  ■ 0 


(66) 


and  we  finally  obtain 


3a 

z 

30 


= 0, 


3a 

r 

30 


= 0 


(67) 


To  summarize,  if  u = f(r),  w = f(z),  and  v = 0,  then  the  stresses 
and  strains  are  independent  of  9 and  z,  provided  the  cylinder  is  stress 
free  in  its  initial  unloaded  state.  A square  element  in  an  r,z  plane, 
with  sides  parallel  to  the  r and  z axes,  becomes  a rectangle  with  sides 
having  the  same  orientation^  The  rotations  are  zero,  as  can  be  seen 
from  the  following  formulas  . 


OJ 

r 


U) 

z 


1_  A Tw  _ 3v 
2 '•r  30  " 3V 

1 ,3u  3w. 

2 ^3z  3r^ 

1 , 3v_  v _ 1_  3u. 
~2  ^3r  + r r 30^ 


(68) 


3.  Love 3 A.  E.H.j  A Treatise  on  the  Magthemat^  Theory  of  Elas tiaityj 
Dover  Publications 3 New  Yorkj  1944.  Page  66. 
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On  referring  to  Eqs.  (1),  (2)  and  (3)  we  see  that 

wr  s 0,  = 0,  wz  = 0 (69) 

The  type  of  displacement  which  occurs  in  the  r,z  plane,  in  both 
the  elastic  and  plastic  zones,  is  shown  in  Figure  2. 


ALLOWABLE 
TRANSLATION 
AND  NORMAL 
STRAINS 


NOT 

ALLOWABLE 
ROTATIONS 
ARE  ZERO 


NOT 

ALLOWABLE 
SHEAR  STRAINS 
ARE  ZERO 


FIGURE  2,  Deformation  in  a Pressurized  Elastic  Perfectly  Plastic 
Cylinder 

We  have  analyzed  the  general  nature  of  the  stresses  and  strains 
in  some  detail,  as  this  aspect  of  the  autofrettage  process  was  the 
subject  of  some  discussion  during  an  investigation  of  a premature  in  a 
175mm  gun. 


IV.  DERIVATION  OF  THE  BASIC  EQUATIONS 

We  now  show  that  the  Prager-Hodge  equations  for  plastic  flow  in  a 
cylinder  under  plane  strain  conditions  may  be  modified  to  account  for 
a non- zero  axial  strain,  Solution  of  the  revised  flow  equations  does 
not  lead  to  a complete  solution  of  the  problem;  in  essence,  we  must 
determine  conditions  at  the  elastic-plastic  interface  that  will  give 
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the  correct  axial  load.  An  iterative  procedure  is  required,  since  the 
axial  load  cannot  be  determined  until  the  integration  of  the  flow 
equations  has  been  completed. 

The  following  considerations  enter  into  the  derivation  of  the 
basic  equations: 


1)  Compatibility  relations,  leading  to  the  Prager-Hodge  strain 
relation. 

2)  Equilibrium  conditions. 

3)  Compressibility  relation. 

4)  Von  Mises  yield  condition. 


5)  Prandtl-Reuss  flow  equations. 

6)  Continuity  relations  at  the  elastic-plastic  interface,  r=p . 

7)  End  conditions  involving  the  axial  strain  or  axial  load. 

Items  6 and  7 are  closely  related;  the  need  to  satisfy  both 
conditions  when  the  axial  load  is  prescribed  leads  to  an  iterative 
procedure,  as  the  axial  load  cannot  be  calculated  until  integration 
of  the  flow  equations  has  been  completed. 


The  compatibility  relation  for  the  strains  is  expressed  in  terms 
of  the  Prager-Hodge  strain  function  and  the  axial  strain  deviation. 

We  have 


<f>  = 


I (Er_£0) 


1_  r9u.  u 
3 ^9r 


9(f) 

9r 


1_  .9  u 1 3u  u . 

3 l7~2  " r 3r  + V 
9r  r 


e = -=-  O + £A+e  ) 
3 v r 6 


e 

r 


e 

r 


® > 


= " e> 


e = e - e 
z z 


(70) 


e 

z 


-£  -£ 
r 


(71) 


635 


and  finally 


cjp  2 

z _ J_  , 9 u 1 3u  u_ 

3r  " 3 " 2 " r 3r  + 2 J 

9r  r 


34>  + ^ z _ 2<j> 

9r  9r  * ’ r 


(72) 


The  strain  deviations  e.  and  e can  be  expressed  m terms  of  e 

Or  r z 

and  4>.  We  find 


6 9 


(3<fi+ez) 


(73) 


e 

r 


- j c*-<g 


(74) 


These  relations  are  required  in  the  derivation  of  the  flow  equation. 

The  equilibrium  equation  is  written  in  terms  of  stress  and  strain 
deviations.  We  have 

s = I (vvV  (75) 

a = s+s  , an  = s+s.  . a = s+s  (76) 

r r*  9 9 5 z z K J 


so  that  Eq.  (21)  becomes 


But 


= 0 


s = 3Ke 


or 


9s 

9r 


3K 


9e 

9r 


9 s 
9r 


3K 


9e 

z 

9r 


(77) 


(78) 


(79) 


We  eliminate  sQ  by  using  the  yield  condition.  We  note  that 

s = - s„  - s (80) 

z 0 r K J 
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so  that  Eq.  (58)  becomes 


2 2.2 
sn  + sAs  + s = k 
0 6 r r 


S6  2 Sr  - SK 


where  for  brevity  we  have  written 


s = V 4k2  - 5s  2 
K r 


The  ambiguous  sign  is  determined  by  conditions  at  the  inner  surface 
during  incipient  plastic  yielding.  Since  the  stresses  are  continuous 
across  the  elastic-plastic  interface  the  Lame'  formulas  are  correct. 
Let 


We  note  that 


a = inner  radius  of  cylinder 

p = radius  of  the  elastic  plastic  interface 

b = outside  radius 

a = -pn,  a =a„  at  r=a 
r z 0 

a = -p. , o =a.  at  r=p 
r rl*  z 1 

a = 0,  a =a„  at  r=b 

r z 2 


since  the  tube  is  elastic  in  the  zone  p<r<b.  The  Lame'  formulas  give 


a 


a 


0 


0 


r,  2 2. 

p1(b  +p  ) 

T2  2 

b -p 

2 2 
P0Cb‘ +Z) 

72  2 

b -a 


at  r=p 


at  r=a 


(81) 


(82) 


We  consider  the  state  of  stress  for  incipient  plastic  yielding  at  r=a. 
On  substituting  the  appropriate  values  of  a , a , and  o into  Eq.  (58), 
we  find  r 2 


(b2-a2)  - 2o0p0  (b2-a2) 


2 r-zu4  4, 
Pa  (3b  +a  ) 


2 2 2' 
3k  (b  -a  ) 


(83) 
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i /i  2 2. 

/(b  -a  ) 


(84) 


ao  * j a2P0  i V3^2  (b2-&2)  - b4p02]  | 

Eq.  (83)  represents  an  ellipse  in  Cartesian  coordinates,  so  there  are 
two  permissible  values  of  cr^  for  each  value  of  p^  unless  the  quantity 
under  the  radical  sign  is  zero.  Prager  and  Hodge  have  shown  the  minus 
sign  should  be  used  in  Eq.  (84)  under  plane  strain  conditions.  Under 
generalized  plane  strain,  open  end  conditions 

Oq  = 0 when  r = a, 

the  minus  sign  again  prevails.  Under  closed  end  conditions,  the 
pressure  of  the  hydraulic  fluid  against  the  end  caps  produces  an  axial 
tension  stress  in  the  cylinder. 

2 2 2 

a0  = a pQ/(b  -a  ) when  r = a 

so  the  quantity  under  the  radical  sign  is  zero.  We  assume  the  minus 
sign  is  correct  under  all  conditions  of  interest  when 

a < p ^ b. 


We  can  now  determine  the  conditions  at  the  elastic-plastic  inter- 
face from  an  assumed  interface  pressure  p^. 

sz  = - 2 V 3[k2  (b2-p2)  - Pl2b4/3(b2-a2)  (85) 

sr  = - \ sz  - b2p./(b2-p2)  (86) 

VlV  b2p./(b2-p2)  (87) 


We  note  that  the  axial  stress  deviation  is  zero  or  negative  at  the 
elastic-plastic  interface.  We  assume  these  conditions  are  also  true 
in  the  plastic  zone.  Then 


s 

z 


1 1 
2 sr  * 2 sk 


s 


6 


1 


2 sr  + 2 Sk 


+ s + s = 0, 
r z * 


(88) 

(89) 
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as  required.  On  substituting  the  preceding  value  of  s&  into  Eq.  (77), 
we  obtain 


3s 

2 

3r 


- 3K 


9ez 

3r 


-3s  +s, 
r k 

2r 


(90) 


for  the  equation  of  equilibrium. 


In  writing  the  Prandtl-Reuss  flow  equations  for  the  stress  and 
strain  deviators  in  the  plastic  zone,  we  use  p rather  than  t as  the 
independent  variable.  This  substitution  is  permissible  since  p is  a 
monotonic,  increasing  function  of  t during  the  first  loading  cycle. 

We  assume,  in  summary,  that  plastic  yielding  and  flow  are  independent 
of  the  rate  of  loading;  this  is  true  to  a first  approximation  provided 
the  rate  of  loading  is  sufficiently  small.  First,  we  eliminate  X from 
the  flow  equations 


2G 


3e 

i 

3p 


3e 


2G 


0 

9p 


3s 

ap 

9se 

3p 


+ Xs 


+ Xsr 


obtaining  the  equation 

3e 

2G  ( 


3p 


3e„ 


se  ‘ 


3p  Sr^ 


!fr 

3p  se 


9se  c 

3p  Sr 


Now 


and 


er  = 2 ^3<f>_ez-) 


e6  " 2 (_;5<t>'ez) 


1 1 

c s — — S + — S 

e 2 r 2 k 


On  eliminating  e,  e^ , and  sfi  from  the  flow  equations,  we  obtain 


0 


3e  - . ..2  3s 

z . ^ 3<j>  4k  r 


G(3s  -s,)  — r—  + (3s  +3s.  ) . - . 

' r kJ  3p  v r k'  3p  s.  3p 


(91) 


where,  as  before,  we  let 


\ — o 7 

s,  = \ 4k  -3s 
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The  preceding  analysis  differs  from  the  original  work  of  Prager  and 
Hodge  mainly  in  the  use  of  e rather  than  e as  one  of  the  three  de- 
pendent variables,  and  in  the  indeterminate  conditions  at  the  elastic- 
plastic  interface.  These  interface  conditions  are  finally  determined 
by  iteration  so  that  one  of  the  following  load  conditions  at  the  end  of 
the  cylinder  is  satisified: 

ez  = 0 Plane  Strain  (Prager  and  Hodge) 

r b 

0 Generalized  plane  strain,  open  end  condition. 

2 2 2 

a p./(b  -a  ) Generalized  plane  strain,  closed  end 
condition. 


(90) , and  (91)  form  a set  of  partial  differential 
equations  in  the  independent  variables  r and  p and  dependent  variables 
<j> , e , and  s . These  equations,  the  initial  conditions,  and  the 
boundary  conSitions  at  the  ends  of  the  cylinder  complete  the  formulation 
of  this  problem.  The  solution  is  obtained  by  the  method  of  character- 
istics, as  described  in  the  next  section. 


/ ro  dr  = 

•'a  z 


r 


ro  dr  = 
z 


Equations  (72) , 


V.  THE  METHOD  OF  CHARACTERISTICS 

The  characteristics  are  found  by  determining  the  conditions  for 
which  a set  of  six  linear  equations  in  the  partial  derivatives  of  e , 
<f>  and  are  inconsistent.  The  first  three  equations  have  been  44 
derived  in  the  preceding  section;  the  second  set  of  three  equations 


is  found  from  differential  relations  among 

dr , d4>  3 

and  ds  . 
r 

3e 

z 

3<j> 

2<j) 

3r 

9r 

r 

3e 

3s 

3s  -s, 

3K  3r 

r 

3r 

r k 
2r 

(92) 

3e 

+ f _ 

2 3p 

3s 

f 2 

1 3p 

r 

3p 

= 0 

(93) 

3e 

dr 

3r 

Z z. 

* 3p  dp 

= de 

z 

(94) 

£»dr 
<i pr 

+ ~ dp 
3p 

= d(f> 

(95) 

3s 

3s 

3? 

dp 
3p  H 

= ds 

r 

(96) 
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where  for  brevity  we  have  written 


f1  = G[3srsk  - sk2]/4k2 

f2  = G[3srsk  + 3sk2]/4k2 

s.  = V 4k2  - 3s  2 
k r 

The  determinant  of  the  coefficients  of  the  partial  derivative  is 


A = 


or 


1 

3K 

0 

dr 

0 

0 


and  finally 


1 

0 

0 

0 

dr 

0 


0 

-1 

0 

0 

0 

dr 


0 

0 

£1 

dp 

0 

0 


0 

0 

£2 

0 

dp 

0 


A = [f2  - fi  - 3K]  dr  (dp)' 


A = [ (4G-3K)  k2  - 3Gsr2]  dr(dp)2/k2 


0 

0 

-1 

0 

0 

dp 


(97) 


The  factor  (4G-3K)  is  generally  negative  for  steel,  so  the  entire 
bracketed  expression  is  negative.  Hence,  if 

A = 0, 

then  dr=0  or  dp=0.  The  net  of  orthogonal  lines  r=c  , p=c  , where  the 
constants  c and  c are  arbitrary,  form  the  characteristics  for  this 
problem.  Certain  jpartial  derivatives  may  be  discontinuous  across  these 
characteristics. 

Along  the  characteristics,  the  partial  differential  equations 
are  replaced  by  ordinary  differential  equations. 
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dez  d<j>  24)  . 

— r-  + -2-  = - — along  p = c 
dr  dr  r 6 p 


de 


ds 


3k 


dr 


de 
: z 

‘1  dp 


+ f 


dr 


d(j) 
2 dp 


3S  -St 
r k 

2r 

ds 


along  p = c 


= 0 along  r = cr 


(98) 

(99) 

(100) 


These  equations  were  solved  by  a first  order  predictor-corrector  method, 
starting  at  the  elastic-plastic  interface  and  integrating  toward  the 
inner  radius  of  the  cylinder.  The  predictor  equations  are 


Ae^  + A4>  = - A r (101) 

3s  (r)-s.(r) 

3KAe  - As  = — ^ Ar  (102) 

z r 2r 

f1(p)Aez  + f2 (p) A4>  - Asr  = Ap  (103) 


and  the  corrector  equations  are 


a » . 1 r2(j>(r)  2^(r+Ar)  n 

Ae  + A$  = - 77  [ ■■----■■  + A.  i A r 

7 T ? L t*  r+Ar  J 


(104) 


3s  (r)-s.(r)  3s  (r+Ar) -s,  (r+Ar) 

3KAe  - As  = ± 0 + oV — (105) 

z r 2 L 2r  2(r+Ax) 


j [f x (p)  +f  j (p+Ap)  ] Aez  + j [f2(p)+f2(p+Ap)]  A(j)  - As^  = Ap  . (106) 


The  predictor  equations  were  used  once  and  the  corrector  equations 
twice  for  each  set  of  nodal  points  involved.  The  range  of  integration 
for  each  independent  variable  was  divided  into  at  least  twenty  equal 
intervals. 

Integration  of  the  differential  equation  for  the  net  axial  load  was 
started  at  the  inner  cylindrical  surface  using  initial  conditions  appro- 
priate for  incipient  plasticity. 

Closed  End  Condition:  pQ  = k(b2-a2)/b2  (107) 

Open  End  Condition:  p^  = k(b2-a2)/  \l  a2+3b2  (108) 


0 
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Plane  Strain  Condition:  =V3  k(b2-a2)/V  (l-2v)a2+3b2  (109) 

The  generalized  plane  strain  condition  applies  to  the  closed  end  and 
open  end  examples. 

Calculations  for  the  plane  strain  condition  were  carried  out  in  the 
manner  indicated  by  Prager  and  Hodge,  and  are  included  for  purposes 
of  comparison.  Once  the  radius  p of  the  elastic-plastic  interface  is 
chosen,  the  remaining  conditions  at  the  elastic-plastic  interface  are 
determined,  and  integration  toward  the  inner  radius  can  be  carried  out 
without  difficulty. 

Under  generalized  plane  strain  conditions,  the  pressure  at  the 
elastic-plastic  interface  is  determined  by  an  iterative  procedure. 

First,  a trial  value  of  the  interface  pressure  is  selected,  generally 
by  extrapolation  from  previous  results,  then  the  axial  stress  and  other 
quantities  are  calculated  in  sequence  as  shown  below. 


r = Pj  the  elastic-plastic  interface  (110) 

<f  = -Pp  an  estimated  trial  value  (111) 


1 2 

h(b,p,Pl)  = % 3[k2(b2-p2)  -Pl2b4] 

= (Pl2p4+3b4Pl2-3k2)/(b2-p2) (Plp2+h) 

(112) 

cr0  = Pl(b2+p2)/(b2-p2) 

(113) 

sr  = (-3b2p1+h)/3(b2-p2) 

(114) 

4 = - 2(l+v)b2Pl/3(b2-p2)E 

(115) 

e = - 2(l+v)h/3E(b2-p2) 

(116) 

The  preceding  values  of  s , <j>,  and  e were  used  as  starting  values  in 
integration  of  Eqs.  (101)^  (102),  an 5 (103). 
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The  normal  stresses  and  strains  for  each  nodal  point  were  calculated 
concurrently.  First,  we  calculate  the  axial  strain  at  the  elastic- 
plastic  interface;  according  to  our  assumptions,  the  axial  strain  is 
constant  across  the  entire  cross  section. 


[(l-2v)p2prh]/E(b2-p2), 

(117) 

a ^ r ^ b 

3 3 - 

£e  " S - 2 ez  - 7 * 

(118) 

3 3 

e = e - e + 4<f> 

r z 2 z 2 Y 

(119) 

e = e - e 
z z 

(120) 

s = 3Ke 

(121) 

s = V 4k2-3s  2 
k r 

(122) 

1 1 

s = -^Ts  "Tsi 

z 2 r 2 k 


a = s+s  , a.  = s+s.,  c = s+s 
r r5  6 0’  z z 


Finally,  the  axial  load  was  obtained  by  integration.  In  the 
elastic  region, 


az  ~ °l>  p 4 r 4 b 

and  the  plastic  region  the  axial  stress  was  given  by  the  sequence  of 
equations  given  above.  The  axial  load  is  given  by 

r 

F = 2 it/  ro  dr 
J a.  z 
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F = 0 in  the  open  end  condition 


F = 


irp^a  in  the  closed  end  condition 


If  F was  not  correct,  a new  interface  pressure  was  chosen  and  the  en- 
tire sequence  of  calculations  repeated.  The  reguli  falsi  method  for 
obtaining  the  correct  value  of  p.  worked  well  for  the  open  end  con- 
dition ’ . 


Under  all  end  conditions  we  have 

p.  ^ k(b2-p2)/b2; 


otherwise  we  have  a negative  quantity  under  the  radical  sign  in  the 
definition  of  k.  The  equality  sign  holds  under  closed  end  conditions 
when  p equals  a;  consequently,  we  find  successive  approximations  for 
p.  obtained  by  standard  methods  did  not  always  satisfy  the  above  in- 
equality. Moreover,  the  results  were  greatly  affected  by  round-off 
error.  To  overcome  these  difficulties,  the  successive  approximations 
for  p.  were  forced  to  satisfy  the  above  inequality,  and  the  results 
were  calculated  to  a high  degree  of  precision.  Details  of  the  method 
of  calculation  finally  adopted  will  be  discussed  in  a forthcoming  report. 


VI.  RESULTS  AND  CONCLUSIONS 

Extensive  numerical  and  graphical  results  were  obtained  for  a 
range  of  wall  ratios,  for  both  plane  strain  and  generalized  plane 
strain.  Results  for  a wall  ratio  two,  open  and  closed  end  conditions, 
under  generalized  plane  conditions  are  presented  in  this  paper. 
Additional  results,  including  the  effects  of  varying  Poisson's  ratio, 
will  be  presented  in  a forthcoming  report. 

The  results  for  the  open  end  condition  are  presented  in  Figures 
3 through  9.  The  existence  of  axial  stresses  is  clearly  indicated. 

We  notice  that  the  axial  stress  has  a maximum  within  the  plastic  zone 
after  the  elastic-plastic  interface  has  progressed  about  halfway 
through  the  tube.  This  result  was  unexpected,  yet  appears  to  be  con- 


4.  Hill3  R.  3 The  Mathematical  Theory  of  Plasticity 3 Oxford 3 1956. 
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5.  Hoffman3  0.  and  Sachs 3 G.3  Introduction  to  the  Theory  of  Plasticity 
for  EngjnaejrSj  McGratf  Hill  Book  Company 3 1953.  Page  92. 
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FIGURE  4.  Circumferential  Stress  Distribution,  Open  End,  Generalized 

Plane  Strain  Condition 
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r/a 


FIGURE  5.  Radial  Stress  Distribution,  Open  End,  Generalized 

Plane  Strain  Condition 
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FIGURE  6.  Residual  Circumferential  Stress  Distribution,  Open  End, 

Generalized  Plane  Strain  Condition 
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r/a 

FIGURE  7.  Residual  Radial  Stress  Distribution,  Open  End,  Generalized 

Plane  Strain  Condition 
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FIGURE  9.  Effective  Combined  Residual  Stress  as  a Function  of  Wall 
Ratio,  .Open  End,  Generalized  Plane  Strain  Condition 
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firmed  by  recent  calculations  based  on  the  finite  element  method  . The 
residual  axial  stress  is  tensile  and  equals  about  one  fourth  of  the 
residual  circumferential  stress  for  the  wall  ratio  under  consideration. 

Similar  calculations  for  open  end  conditions  were  carried  out  for 
wall  ratios  in  the  range  1. 5-3.0  inclusive.  The  corresponding  residual 
stresses  were  also  calculated,  assuming  the  re-yielding  would  not  occur 
at  the  inner  surface;  these  results  are  shown  in  Figure  8.  The  effective 
compressive  stresses  corresponding  to  the  Von  Mises  yield  condition 
were  also  calculated  to  determine  the  maximum  wall  ratio  for  which  the 
effective  compressive  residual  stress  at  the  inner  surface  was  less  than 
the  yield  stress;  after  several  iterations  a wall  ratio  of  2.25  was 
obtained,  as  shown  in  Figure  9.  If  the  entire  cylinder  yields  plastically 
when  pressurized,  then  re-yielding  will  occur  at  the  inner  surface  for 
larger  wall  ratios;  the  dotted  lines  indicate  predicted  residual  stresses 
in  the  absence  of  re-yielding.  A new  analysis  and  a corresponding  pro- 
gram would  be  required  to  calculate  the  actual  stress  conditions  under 
re-yield  conditions. 

Calculations  for  the  closed  end  condition  were  also  carried  out, 
as  shown  in  Figures  10  through  14.  The  pressure  on  the  end  caps  of 
the  cylinder  produces  a large  tensile  stress  in  the  cylinder.  For  this 
reason  the  axial  stress  is  larger  than  that  obtained  for  the  open  end 
condition,  and  increases  steadily  through  the  thickness  of  the  plastic 
zone.  In  actual  practice,  we  would  expect  the  end  conditions  would  be 
intermediate  between  the  open  and  closed  end  conditions,  as  friction  in 
the  seals  would  contribute  to  the  axial  stress.  The  analysis  of  this 
paper  could  be  modified  to  take  friction  into  account  if  we  assumed  a 
constant  coefficient  of  friction  for  all  values  of  the  internal  pressure. 

Experiments  to  confirm  the  predictions  of  the  preceding  analysis 
and  calculations  would  be  desirable.  It  is  essential  that  the  length 
to  diameter  ratio  be  large  enough  to  prevent  end  effects  from  influencing 
the  stress  and  strain  distribution  in  the  central  part  of  the  cylinder, 
where  presumably  these  quantities  would  be  measured.  Measurements  in 
both  the  circumferential  and  axial  directions  are  required  in  order  to 
fully  determine  the  stress  distribution  on  the  outside  of  the  cylinders. 
Special  instrumentation  is  required  to  obtain  measurements  of  the  strain 
at  the  inner  surface  due  to  the  presence  of  oil  under  high  pressure. 

The  Sachs  method  of  measuring  residual  stresses  can  be  modified  to 
measure  axial  as  well  as  circumferential  residual  stresses. 


6.  Chen , P.C.T.,  The  Finite  Element  Analysis  of  Elasti^^lastio  Thick- 
Walled.  TubeSj  Watervliet  Arsenal  Technical  Report  WVT-74089 3 
September 3 1974.  Page  28 , Figure  5.  This  report  contains  an  ex- 
cellent bibliography  of  recent  work3  with  emphasis  on  current  re- 
search at  Watervliet  Arsenal. 
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In  calculating  the  residual  stresses,  it  has  been  assumed  the 
material  is  elastic  and  the  stress-strain  relations  are  linear  during 
the  release  of  pressure.  However,  some  non-linearity  may  be  expected 
as  the  re-yield  condition  is  approached,  due  to  the  Bauschinger  effect. 
For  this  reason  the  actual  residual  effective  stress  may  be  somewhat 
less  than  the  calculated  value.  It  is  hoped  the  analysis  and  calcula- 
tions in  this  paper  will  make  it  possible  to  delineate  the  respective 
roles  of  the  end  conditions  and  the  Bauschinger  effect  in  producing  the 
observed  field  of  residual  stresses. 
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CHEMICALLY  REACTING  DIFFUSIVE  SYSTEMS 
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ABSTRACT 


Various  problems  occuring  in  chemical  reactor  theory  and  the  theory  of 
chemically  or  biochemically  reacting  mixtures  are  studied.  In  particular, 
we  investigate  the  processes  controlling  multiplicity  and  its  implications 
with  regard  to  ignition  and  extinction  in  a reactor  and  the  processes  by  which 
stable  oscillatory  states  are  set  up.  In  inhomogeneous  systems  with  spatially 
and  temporally  distributed  parameters  many  of  the  phenomenia  are  locally  dis- 
tributed. Various  perturbation  procedures  have  been  developed  to  analytically 
study  these  problems  and  to  simplify  numerical  procedures. 
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1.  Introduction. 


We  shall  formulate  and  describe  certain  recently- 


occurring  problems  arising  in  various  fields  such  as  the  theory  of  che- 
mical and  biochemical  reactions,  chemical  and  nuclear  reactors,  com- 
bustion, diffusion  through  membranes  and  porous  media,  Joule  heating, 
and  soil  mechanics.  We  do  not  wish  to  imply  that  any  one  field,  much 
less  all  of  them,  is  treated  globally.  In  fact,  we  wish  to  strongly  em- 
phasize the  point  that  although  the  equations  describing  the  various  pro- 
blems have  a certain  common  form,  the  parameters  and  specific  non- 
linearities  are  different,  and  the  specifics  of  each  problem  clearly  deter- 
mine the  techniques  used  and  the  results  obtained.  It  is  just  not  enough 
to  think  formally  about  a class  of  problems  of  which  the  one  of  interest 
is  a special  case.  Thus,  although  we  present  the  general  class  of  equa- 
tions for  the  above-mentioned  fields,  we  shall  present  results  only  for 
two  specific  problems  in  an  attempt  to  give  some  idea  of  the  type  of 
techniques  used  and  the  kinds  of  results  obtained.  A rather  complete 
history  of  recent  work  as  well  as  references  to  the  work  of  many  resear- 
chers can  be  found  in  the  references  [ l]  -[  5]  . 

Mathematically,  in  the  simplest  one- dimensional  geometries  all  the 
problems  consist  of  finding  the  temperature  T (x,  t)  and  the  concentra- 
tions Ck  (x,  t)  , i = 1 , . . . , N , of  the  reacting  species  at  any  point 

x at  any  time  t in  the  region  of  interest.  Allowing  for  chemical  reac- 
tion, molecular  diffusion,  and  convection,  the  governing  equations  for 


eithe  r 

adiabatic 

or 

non- 

•adiabatic 

situations 
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Here  the  represent  the  diffusion  coefficients  (which  in  some  problems 

could  depend  explicitly  on  the  C^_  and/or  T),  are  known  constants, 

and  the  F are  almost  always  of  the  form  of  polynomials  in  the  con- 
centrations  multiplied  by  the  usual  exponential  Arrhenius  tempera- 

ture dependence  (or  sums  of  such  forms).  We  shall  confine  our  atten- 
tion to  the  one-dimensional  situation  described  by  (1),  (2).  The  neces- 
sary modification  for  a multi-dimensional  problem  is  clear;  namely,  the 
diffusive  terms  become  div(D^  grad  • ) and  the  convection  terms  in- 

volve the  appropriate  gradients.  Note  that  we  need  only  the  equations 
for  (N  - 1)  concentrations  since  the  usual  stochiometric  equation  (mass 
balance)  easily  gives  the  concentration  of  the  remaining  species  once  all 
the  others  are  known. 

In  actual  reaction  problems  three  situations  all  occur  which  make 
simplification  of  the  system  of  equations  (1),  (2)  possible.  (i)  Some  of 

the  equations  completely  decouple  from  the  rest  (by  themselves  or  in 
blocks).  (ii)  Three  time  scales  usually  predominate  so  that  some  com- 
pounds are  formed  much  faster  or  much  slower  than  the  scale  of  inter- 
est. Thus,  some  compounds  are  formed  or  destroyed  so  fast  that  they 
can  always  be  considered  in  equilibrium,  and  some  compounds  can  be 
taken  to  be  constant  (often  at  non-equilibrium  values)  because  they  are  so 
slowly  reacting  and  thus  simply  undergo  a slow  drift  in  the  value  of  the 
concentration.  (iii)  Various  dimensionless  parameters  occur  in  widely 
disparate  sizes.  Thus,  a very  successful  approach  has  been  to  combine 
singular  perturbation  methods  for  both  large  and  small  values  of  various 
parameters  with  multi-time  scale  perturbation  methods.  For  problems 
in  the  theory  of  both  continuous  stirred  tank  and  tubular  chemical  reac- 
tors A.  B.  Poore  [6]  has  found  with  various  numerical  checks  that  the 


perturbation  analysis  for  large  Peclet  number  (P  — *•  oo)  provides  good 
results  for  P as  low  as  2 or  3,  and  the  analysis  for  small  Peclet 
number  (P  0)  provides  good  results  for  P close  to  unity.  Thus, 
it  is  believed  that  the  perturbation  analysis  yields  all  possible  phenomena 
which  can  occur. 

To  illustrate  the  diverse  phenomena  which  can  occur  we  will  des- 
cribe one  set  of  operating  characteristics  for  a simple  non-adiabatic  tu- 
bular reactor  for  one  range  of  the  parameters. 


Fig.  I 

Figure  1 is  the  response  diagram  (i.  e.  , the  behavior  of  the  ampli- 
tude of  the  temperature  IITII  = q^x^I  ^ ^ 1 as  a certain  parameter,  the 
Dahmkohler  number  D,  varies)  for  values  of  the  physical  parameters  in 
certain  ranges  (and  these  ranges  are  given  explicitly  by  A.  B.  Poore  [6]).  We 
shall  trace  the  process  as  D is  increased  starting  at  a small  amplitude  stable 
steady  state  for  D near  0.  The  response  moves  along  the  path  OJA. 


As  D is  increased  past  D^,  the  temperature  undergoes  a jump  to  a 
large  amplitude  stable  periodic  response  above  point  L,  and  as  D is 
further  increased,  the  amplitude  of  this  oscillation  decreases  until  the 
oscillation  vanishes  at  point  Q.  The  stable  steady  state  is  followed  up 
the  branch  through  K as  D increases  still  further.  Now,  as  D is 
decreased,  the  process  follows  the  branch  KQLMB,  the  solution  being 
steady  and  stable  from  K to  Q and  from  M to  B with  a stable 
oscillation  of  increasing  amplitude  from  Q to  M.  As  D decreases 
through  D^,  the  oscillation  just  ceases  (subcritical  branching  at  P). 

As  D is  decreased  below  Dd,  there  is  an  extinction  as  the  response 
jumps  to  point  J and  then  follows  the  path  J to  0.  The  analysis 
and  numerical  computations  to  support  this  description  are  given  in 
[7]  -[8], 

For  the  continuous  stirred  tank  reactor  and  the  simple  tubular 
reactor  it  has  been  possible  to  identify  fourteen  or  fifteen  different  re- 
sponse diagrams  depending  on  the  various  ranges  of  the  six  independent 
physical  parameters.  Briefly  stated,  many  of  the  outstanding  problems 
consist  of  finding  all  the  possible  response  diagrams  for  more  involved 
reactors  such  as  adiabatic  and  non-adiabatic  packed  bed  and  moving  bed 
reactors  perhaps  involving  higher  order  kinetics,,  An  extremely  interest- 
ing implication  of  some  of  these  distributed  type  reactors  is  the  appear- 
ance of  solutions  involving  interior  and  moving  boundary  layers  [ 10]  . 

To  illustrate  the  types  of  phenomena  currently  arising  in  problems 
involving  so-called  localized  temporal  and  spatial  instabilities  we  shall 
consider  the  following  equations: 

(3)  fjr  = A - (B+l)  uu2v+D,0 


= Bu 


U2  V + D 2 


92  v 
9x2 
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9t 
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Here  B is  a parameter  and  A(x)  is  a given  function  of  x.  These 
equations  (together  with  appropriate  boundary  and  initial  conditions)  have 
been  proposed  by  I.  Prigogine  (see  [ U]  , [12]  for  references)  as  a model 
to  describe  observed  localized  temporal  and  spatial  structures  involving 
concentration  waves  and  localized  instabilities  in  chemically  reacting  sys- 
tems. The  equations  (3),  (4)  arise  in  writing  the  conservation  laws  for 
a chemical  reaction  A + B -*  D + E through  a certain  autocatalytic  step, 
u and  v represent  the  chemical  concentrations  of  two  reactants,  and 
once  they  are  found  the  concentrations  of  all  other  products  and  reactants 
are  found  by  solving  certain  linear  parabolic  problems  by  routine  methods. 

Perhaps  the  simplest  presentation  of  some  of  the  phenomena  con- 
tained in  this  system  can  be  made  by  means  of  Figures  2 to  9 which  have 
been  taken  from  [ 11]  . The  (D2 , B)-plane  is  divided  into  three  regions 


as  shown  in  Fig.  2, 


Fig.  a Fig.  3 

For  fixed  D2  , B in  region  I there  exists  a unique  steady  state  solution 
v,  v of  the  problem,  and  these  solutions  have  the  form  illustrated  in 
Fig,  3.  As  the  values  of  Dz  and  B are  changed  so  that  we  cross 


from  region  I into  region  II,  these  solutions  become  unstable  and  the 
new  stable  steady  states  of  Fig.  4 evolve.  These  new  states  clearly 
differ  significantly 


Fig.  4 Fig.  5 

from  the  old  states  only  through  a localized  spatial  oscillation.  As  we 
cross  from  region  I to  region  III,  the  stable  steady  states  of  region  I 
lose  their  stability,  and  localized  stable  temporal  oscillations  (i.e.,  con- 
centration waves)  are  set  up  as  illustrated  in  Figs.  5-8  (which  show  the 
behavior  of  the  solution  over  half  a period).  For  fixed  x = \ , Fig.  9 
shows  that  this  oscillation  is  of  relaxation  type. 


Fig.  6 
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Fig.  7 
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Fig.  8 Fig.  9 

Thus,  as  stability  boundaries  are  crossed  the  new  stable  states 
differ  from  the  old  stable  states  only  in  local  regions.  This  localization 
is  due  to  the  non-uniform  distribution  A(x)  of  the  reactant  A.  Mathe- 
matically, this  phenomenon  is  suggested  by  the  presence  of  turning-points 
in  the  linearized  stability  equations.  The  analytical  calculations  to  sup- 
port the  structure  as  we  have  just  described  it  is  given  by  J,  A,  Boa 
[12]  by  using  various  combinations  of  WKB,  singular  perturbation,  and 
multi-time  scale  perturbation  techniques. 

The  model  we  have  just  described  is  one  of  the  few  currently  re- 
ceiving considerable  attention  in  that  it  is  a simple  reaction-diffusion  sys- 
tem with  qualitatively  accurate  descriptions  of  certain  simple  reacting 
systems.  In  particular,  it  is  thought  that  certain  aspects  of  the  Belousov- 
Zhabotinsky  reaction  (see  [ 13]  and  [ 14]  for  detailed  descriptions)  can  be 
accounted  for  with  this  model.  This  reaction  yields  relaxation  oscillations 
and  various  concentration  wave  interactions  all  easily  visible  to  the  naked 
eye  as  color  changes  in  the  reacting  chemicals.  The  results  of  N.  Kopell 


cc  n 


and  L.  N.  Howard  [ 14]  , [ 15]  and  J.  A.  Boa  [ 1 2]  and  J.  A.  Boa  and 
D.  S.  Cohen  [4]  account  qualitatively  for  much  of  the  observed  phenomena. 

Ultimately,  it  would  be  desirable  to  consider  the  more  complex 
problems  in  biochemical  pattern  formation  (i.  e.  , the  spatial  and  temporal 
structure  of  the  evolving  systems).  Experimental  observations  are  avail- 
able and  some  generally  accepted  mathematical  models  exist.  Of  particu- 
lar current  interest  are  certain  questions  concerning  biochemical  oscilla- 
tions (on  scales  ranging  from  a few  seconds  to  several  days).  A review 
containing  an  extensive  list  of  references  has  been  given  by  B.  Hess  and 
A.  Boiteux  [ 16]  . In  many  problems  the  chemical  kinetics  (for  example, 
the  rate  functions)  and  the  (possibly  nonlinear)  diffusion  coefficients  are 
not  completely  known,  but  their  general  forms  are  usually  known,  and 
from  this  it  is  possible  to  derive  much  of  the  qualitative  theory. 
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ABSTRACT,  The  potential  function  is  approximated  in  the  one  dimen- 
sional Schrodinger  equation  by  a step  function  with  an  arbitrary,  finite 
number  of  steps.  In  each  step  the  resulting  differential  equation  has 
constant  coefficients  and  is  integrated  exactly  in  terms  of  the  trigono- 
metric or  hyperbolic  functions.  The  solutions  are  then  matched  at  the 
interface  of  each  layer.  The  eigenfunction  is  then  constructed  over  the 
entire  domain.  This  numerical  method  has  certain  unique  features;  (a) 
the  potential  function  does  not  have  to  be  known  analytically;  (b)  for  a 
given  fixed  number  of  steps  in  the  potential  approximation,  all  the  eigen- 
functions and  eigenvalues  have  the  same  absolute  accuracy;  (c)  any  number 
of  eigenvalues  and  eigenfunctions  can  be  obtained  in  a single  computer  run 
without  any  need  to  guess  initial  eigenvalues;  (d)  for  a given  fixed  num- 
ber of  steps  in  the  potential  approximation  we  could  obtain  the  whole  in- 
finite spectrum  of  eigenvalues  and  eigenfunctions;  (e)  very  low  computation 
time  on  the  computer. 

1.  INTRODUCTION.  A numerical  method  is  presented  for  the  solution  of 
one  dimensional  Schrodinger  equation  which  is  rather  good  from  a practical 
(computation  time)  and  conceptual  point  of  view. 

The  potential  is  approximated  by  a step  function  with  an  arbitrary 
but  finite  number  of  steps.  In  each  step  the  resulting  differential 
equation  has  constant  coefficients  and  is  integrated  exactly  in  terms  of 
trigonometric  or  hyperbolic  functions.  The  solutions  are  then  matched  at 
the  interface  of  each  layer,  and  the  eigenfunction  is  then  constructed 
over  the  entire  domain.  A very  familiar  idea  in  Quantum  Mechanics  is  used 
in  the  development  of  the  numerical  method  which  is  the  matching  of  the 
Schrodinger  equation  solutions  and  their  derivatives  at  the  interface  of 
the  square  well  potentials  used  and  then  solving  for  the  constant  coeffi- 
cients. This  work  represents  the  implementation  and  testing  of  a numeri- 
cal algorithm  which  solves  the  Schrodinger  equation  for  a step  potential 
function  with  an  arbitrary  number  of  steps. 
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This  method  has  been  tested  on  several  problems  and  the  numerical 
results  are  very  good.  The  only  input  into  the  computer  program  is  a 
numerical  table  of  the  potential.  No  initial  estimates  of  the  eigen- 
values are  necessary.  The  computer  program  in  a single  pass  will  output 
any  desired  number  of  eigenvalues  and  the  corresponding  eigenfunctions  and 
their  nodes.  Computation  time  is  roughly  equal  for  the  eigenvalues  as  for 
the  eigenfunctions.  The  eigenvalues  are  computed  independently  from  the 
eigenfunctions  and  can  be  computed  only  by  themselves  thereby  cutting  com- 
putation time  in  half  if  only  the  eigenvalues  are  wanted. 

In  this  numerical  method  the  potential  is  approximated  by  a step 
function,  but  once  the  approximate  Schrodinger  equation  is  set  up,  it  is 
solved  exactly,  What  this  implies  is  that  all  eigenvalues  and  eigenfunc- 
tions are  all  of  the  same  accuracy.  The  reason  for  this  is  that  all  the 
eigenfunctions  are  exact  solutions  to  a given  Schrodinger  equation  (i.e., 
they  are  written  down  explicitly  in  terms  of  trigonometric  and  hyperbolic 
functions).  The  numerical  results  obtained  substantiate  this  expectation. 

In  more  conventional  methods  such  as  Rayleigh-Ritz  [11,  the  higher 
eigenvalues  and  eigenfunctions  are  not  as  accurate  as  the  fundamental 
eigenvalue  and  eigenfunction,  making  it  necessary  to  progressively  increase 
the  number  of  mesh  points  in  order  to  compute  higher  eigenvalues  and  eigen- 
functions. The  reason  for  this  is  that  the  eigenfunctions  are  themselves 
approximations  to  the  solutions  of  a given  Schrodinger  equation.  In  any 
one  of  these  conventional  approximation  methods,  as  the  higher  eigenfunctions 
oscillate  more  rapidly,  as  their  order  increases,  more  mesh  points  are  neces- 
sary to  compute  them  with  a given  accuracy.  This  does  not  occur  in  the 
method  presented  in  this  thesis. 

2,  STATEMENT  OF  THE  PROBLEM.  The  one  dimensional  Schrodinger  equa- 
tion is  written  in  dimensionless  form  as 


-A^y  + V(x)y  = Ey 


or 

(1)  - V(x)y  + Ey  = 0 , 


where  V(x)  is  the  potential  function  and  E is  the  energy  eigenvalue.  For 
central  field  problems  and  for  bound  states 

(a)  V(x)  is  infinite  at  the  origin  x = 0, 

(b)  V(x)  has  a negative  minimum  value  for  some  x = a, 

(c)  V(x)  approaches  zero  asymptotically  as  x ->  <». 

We  could  also  consider  problems  for  symmetric  potential  wells  with 
infinitely  high  walls,  because  this  method  is  quite  general  and  can  be 
applied  in  both  cases.  The  Schrodinger  equation  will  be  treated  as  a 
Sturm-Liouville  problem  (i.e.,  the  calculations  are  restricted  to  a finite 
domain),  and  therefore  the  character  of  the  potential  function  V(x)  is  of 


secondary  importance  with  respect  to  the  computations.  We  will  restrict 
Eq.  (1)  to  the  following  homogeneous  boundary  conditions 

(2)  y(0)  = y(L)  = 0 . 


The  boundary  conditions  (2)  are  rigorous  for  a potential  well  problem 
with  infinitely  high  walls;  for  a central  field  problem  it  is  necessary  to 
approximate  the  right  boundary  condition  such  that  the  eigenfunctions  remain 
finite  for  x by  taking  suitably  large  L in  (2)  [2]#  This  is  a good  approx- 
imation since  for  large  enough  L,  the  eigenfunctions  approach  infinitely 
small  values. 

3.  ANALYTICAL  ASPECTS.  We  begin  by  approximating  an  analytical 
potential  V(x)  such  as  in  Fig.  1,  if  available,  by  a step  function  in  a 
well  defined  way  which  need  not  be  specified  now,  in  the  following  way: 


(3) 


V]  0 < x < xj 
V2  X]_  < x < X2 


V (x)  ±J 


v3 


X2  < x < X3 


V x ..  < x < x = L 
L n n-1  n 


In  (3),  the  constant  values  V1,V2,...,Vn  are  step  function  approxi- 
mations of  an  analytical  potential  V (x) . The  potential  can  be  approximated 
with  arbitrary  accuracy  0(h),  if  a sufficiently  small  step  width  h = x^  - 
x^_j  is  utilized  [3].  Therefore,  our  approximate  problem  (3)  can  approach 
the  exact  problem  as  closely  as  we  desire,  solely  by  picking  the  desired 
step  widths  h.  The  analytical  form  of  the  potential  need  not  be  known  in 
order  to  apply  the  approximation  (3).  (See  Fig.  2.)  Once  a given  number 
of  steps  is  chosen,  the  approximate  problem  is  solved  exactly  in  terms  of 
elementary  trigonometric  and  hyperbolic  functions. 

In  each  layer  i,  the  approximate  problem  becomes 

2. 

(4)  + (E  - V±)y  = 0 , i = 1,2, ...  ,n  . 


We  shall  define 


(5) 


= E - 


V. 

l 


b?  = 

1 


so  that  the  solution  to  (4)  is 


(6) 


J 


y = A.F(b.x)  + B.G(b.x) 

X I X X 


1 2,3,«.*,n  ” 1 


where  F and  G are  given  in  Table  I,  and  A.  and  B.  are  integration  con- 
stants. 

Table  I 

Solutions  of  Equation  (4) 

a.  < 0 (Forbidden  Region)  a.  = 0 a.  > 0 (Allowed  Region) 

l l l 

F(b^x)  cosh(b^x)  1 cos(b^x) 

G(b.x)  bT1  sinh(b.x)  x bj1  sin(b.x) 

l l i i i 


These  solutions  exhibit  the  well-known  fact  that  the  Schrodinger 
equation  solutions  are  oscillatory  within  the  region  defined  by  the  two 
turning  points  and  exponential  outside.  In  central  field  problems,  the 
classical  turning  points  are  defined  by 

(7)  a,  = E - V.  = 0 

l l 

where  the  total  energy  equals  the  potential  energy.  Classically  this  is 
the  point  at  which  the  incident  kinetic  energy  of  the  incident  particle 
equals  its  potential  energy,  and  therefore  the  point  at  which  the  kinetic 
energy  of  the  particle  is  zero.  It  will,  at  the  next  instant,  change  the 
direction  of  its  motion;  therefore  these  points,  at  which  E = V.,  are 
referred  to  as  "classical  turning  points."  It  is  interesting  to  note  here 
that  other  approximations,  namely  the  W.K.B.J.*  approximation,  fail  in  the 
neighborhood  of  turning  points  and  require  special  consideration  [4].  Any 
one  of  a number  of  textbooks  on  Quantum  Mechanics  will  cover  the  W.K.B.J. 
approximation  [5].  In  summary,  the  W.K.B.J.  method  is  a short  wave  length 
approximation  technique  used  for  solving  the  Schrodinger  Eq.  (1).  As  the 
wavelength  decreases  the  variation  of  the  potential  V(x)  over  one  wave- 
length becomes  smaller.  The  approximation  is  then  made  that  in  the  limit 
we  may  consider  the  potential  V(x)  as  a constant  for  several  wavelengths 
about  x and  that  in  this  region  the  momentum  is 

(8)  p(x)  = /2m (E  - V(x))  . 

The  corresponding  approximate  solution  of  Eq.  (1)  is  then  given  by 
the  plane  wave  solution 

(9)  y = exp(±  i/h  f p dx) 


*The  letters  W.K.B.J.  stand  for  G.  Wentzel,  H.  A.  Kramers,  L.  Brillouin, 
and  H.  Jeffreys,  who  more  or  less  independently  rediscovered  the  procedure 
in  connection  with  the  solution  of  different  problems. 
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The  nquasi-classicaln  condition  that  the  wavelength  of  the  particle 
vary  slowly  over  distances  of  the  order  of  itself  is  written 


(10) 


d (A/ 2tt) 
dx 


where  A(x)  = 2irh/p(x)  is  the  de  Broglie  wavelength  of  the  particle  and 


£ = k = J M4I) 

h T h2 


is  the  wave  number. 

Condition  (10)  can  be  rewritten  as 


(11)  (h(2m(E  - V))  1/2) 


| (2m(E  - V)"3/2  (2m (E  - V)) 


mh 


(2m(E  - V)) 


dV (x) 

3/2  dx 


« 1 


It  can  readily  be  seen  that  the  solution  (9)  will  fail  at  those 
points  (the  turning  points)  at  which  E = V(x),  that  is,  the  zeros  of 
Eq.  (11).  This  problem  is  not  encountered  in  our  method  of  solution. 

The  solution  (6)  in  regions  1 and  n are  given  respectively  by 
(12)  y = A1F(b1x)  + B^O^x)  , y = A F(b  x)  + B^GOyO  . 


Now  applying  boundary  conditions  (2)  to  these  solutions,  we  obtain 


(13) 


y = BiGO^x)  , y = Bytb^x  - L)] 


since  in  neither  region  1 or  n is  the  function  F equal  to  zero,  which  in 
turn  implies  that 


At  = A = 0 . 

1 n 

The  function  G is  defined  in  Table  I,  and  Bi  and  B are  integration 

n 

constants. 

We  are  now  left  with  the  straightforward  task  of  determining  the 
integration  constants  k±  and  B±.  This  is  done  by  matching  the  solutions 
(6)  and  its  derivatives  at  the  interfaces,  namely 
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(14) 


yi+i » 


y ? = y ? 

>i+l 


The  general  equations  for  matching  the  solutions  and  its  derivatives 
at  the  ith  interface  are 


(15) 


A.F(b,x.)  + B.G(b.x. ) = A nF(b  1x. ) + B.  _G(b.  .x. ) » 

ill  111  l+l  l+l  l l+l  l+l  l 

A.F' (B.x.)  + B.G' (b.x.)  = A.  F' (b. , _x.)  + B. , .G* (b.  -X.) 
l ii  l ii  l+l  x+1  i i+l  l+l  i 


At  the  first  interface,  which  involves  boundary  region  1 in  which 
Aj  = 0,  the  first  matching  pair  of  solution  and  derivative  equations 
become 


(16) 


BjG(b3x3)  = A2F(b2x3)  + B2G(b2x3)  , 

BiG'O^xO  = A2F’(b2x1)  + B2GJ(b2x1)  . 


At  the  second  interface  the  matching  pair  of  solution  and  derivative 
equations  are 


(17) 


A2F (b2x2)  + B2G(b2x2)  = A3F (b3x2)  + B3G(b3x2)  , 

A2F' (b2x2)  + B2G' (b2x2)  = A3F* (b3x2)  + B3G’ (b3x2) 


This  matching  process  is  performed  at  all  interfaces  until  reaching 
the  interface  at  Xp_j  which  involves  boundary  region  n in  which  An  = 0. 
Therefore,  the  final  pair  of  equations  matching  solution  and  derivative 
at  the  rightmost  interface  xn_^  are 


(18) 


A , F(b  , ) + B , G(b  ..  x .)  = B G[b  (x  . - L)  ] 
n-1  n-1  n-1  n-1  n-1  n n n-1 

A ,F'(b  nx  ) + B G*  (b  ..x  ., ) = B G*  [b  (x  - L)  ] 

n-1  n-1  n-1  n-1  n-1  n-1  n n n-1 


Rewriting  this  system  of  equations  in  a homogeneous  form,  we  obtain 


BjGCbjX^)  - A2F(b2x3)  - B2G(b2Xj)  = 0 
BjG!(bjXj)  — A2F! (b2x3 ) — B2G* (b2x3)  ~ 0 

A2F(b2x2)  + B2G(b2x2)  - A3F(b3x2)  - B3G(b3x2)  = 0 
A2F * (b2x2)  + B2Gf(b2x2)  - A3F’(b3x2)  - B3Gf (b3x2)  = 0 

(19) 


A , F(b  ,x  .)  + B _G(b  .x  ) - B G[b  (x  , 
n-1  n-1  n-1  n-1  n-1  n-1  n n n-1 


n-1 


F*  (b 


n-1  n-1 


) + B ^G'  (b tx , ) - B„G’[b_(x 


n-1  n-1 


n 


nv n-1 


- L)]  = 0 

- L)]  = 0 
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V ( x ) 


; 

V = Vq 

l 

¥1 

T 

2 

boundary  conditions  at  x - 


(1)  Y-l(x=x1)  » v2(x=x1) 


(2)  4l'(x=x1)  * '1'2^x=xi') 


where  : 


. . . ikx  -ikx 

4*  ^(x)  = Ae  + Be 


. v „ ikx  , n -ikx 
Y 2(x)  = Ce  + De 


A - V0  . k.'  - /V0  - E 


Figure  3.  Typical  barrier  penetration  pro:  am 
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In  (19)  the  primes  designate  the  derivates  of  the  functions  eval- 
uated at  the  interfaces.  System  (19)  is  a homogeneous  system  involving 
2n  - 2 equations  and  2n  - 2 unknowns  (A-^,  i = 2,3,,.,,n  - 1;  B^,  i = 

1, 2,3, . . . ,n) . In  order  to  solve  for  non-trivial  solution  of  the  unknowns 
we  must  require  that  the  determinant  of  the  coefficients  of  system  (19) 
be  identically  zero.  That  is, 


A 


(20) 


G^jXj)  - F(b2xj)  - G(b2xi) 

G'Onx!)  - F'O^Xi)  - G'(lxi) 

F(b2x2)  G(b2x2)  - F(b3x2)  - G(b3x2) 

F’(b2x2)  Gf (b2x2)  - F' (b3x2)  - G'(b3x2) 


r<VlVl>  G(bn-lVl)  - G[V*n-l  * L)  1 
r'<VlVl>  G,(bn-lVl>  - G'tbn<Vl  - 1)1 


= 0 . 


The  zeros  of  this  determinant  equation  are  the  eigenvalues  of  the 
approximate  problem,  (1),  (2)  and  (3).  For  each  eigenvalue,  there  is  a 
non-trivial  solution  for  A^  and  which  in  turn  defines  its  correspond- 
ing eigenfunction.  From  this  point  on,  | A | will  be  looked  upon  as  a 
function  of  a real  variable  E,  namely  f(E)  = |a| . 

At  this  point  we  shall  poinc  out  differences  between  system  (19)  and 
those  obtained  by  the  variational  methods,  in  particular  the  Rayleigh- 
Ritz  method  [1],  In  the  latter  variational  method  one  obtains  a homogen- 
eous system  in  which  E is  a dependent  variable,  that  is,  an  algebraic 
system.  If  the  algebraic  system  is  of  order  n,  one  can  only  obtain  n 
eigenvalues  and  eigenfunctions.  Moreover,  there  is  no  definite  guarantee 
that  the  values  of  E obtained  in  this  method  will  be  the  exact  values,  as 
there  is  no  rigorous  establishment  of  convergence  in  the  Rayleigh-Ritz 
method.  On  the  other  hand,  the  homogeneous  system  (19)  that  we  derived 
is  a transcendental  system.  The  merits  of  this  system  are  obvious.  The 
determinant  Eq.  (20)  is  a transcendental  equation  and  will  always  have  an 
infinite  number  of  real  roots  (zeros).  As  will  be  seen  later,  one  can 
easily  define  an  iterative  process  whereby  we  can  span  a definite  energy 
range,  thereby  driving  the  determinant  equation  to  zero  at  the  real  eigen- 
values encountered  in  that  range.  As  with  any  trancendental  system,  the 
accuracy  desired  is  only  limited  by  a practical  consideration,  namely, 
computational  time.  The  fact  that  all  the  roots  are  real  is  guaranteed 
by  the  fact  that  (1) , (2)  and  approximation  (3)  form  a Sturm-Liouville 
system  [6].  Equation  (1)  is  a type  of  Liouville  equation  whose  differ- 
ential operator 

H - f^T  + E - V(x) 


(21) 


is  a hermitian  operator.  Indeed,  consider  two  eigenfunctions  (x)  and 
U2 (x)  being  operated  upon  by  H,  that  is 


(22) 


di_ 

dx2 


U1 


+ EjUi 


Vuj  = 0 , 


+ E2u2  - Vu2 


= 0 . 


Now  multiplying  on  the  left  the  equation  for  uj  by  u2  and  the  equa- 
tion for  u2  by  uj  and  subtracting: 

d2  d2 

(23)  u2  u-^  + U2(Ej  - V)uj  - u^  u2  - ui (E2  - V)u2  =0  , 

dx2  dx2 


or 

d2  d2 

(24)  u2  — uj  - uj  — u2  = u^E2u2  - u2E^u^  + u2Vu^  - uiVu2  =0  , 

dx2  dx2 


but,  since 

(25)  u2Vu^  - u^Vuj  = u2Vuj  - u2Vu^  = 0 , 


we  have 
(26) 


u2  Uj  - Uj  - 

dx2  dx2 


u2  = (E2  - E2)u2u2 


The  reason  we  obtain  so  simple  a relation  as  (26)  for  comparison  is 
because  the  modified  Liouville  Eq.  (1)  is  self-adjoint.  This  in  turn 
implies  that  H in  (21)  is  a hermitian  operator.  It  is  an  easy  task  to 
show  that  the  eigenvalues  of  a hermitian  operator  are  real.  Consider  the 
inner  product  of  the  two  eigenfunctions  u and  Hu: 

(27)  (u,Hu)  = (u,Eu)  = (E*u,u)  = E*(u,u) 

where  E is  the  eigenvalue  of  the  hermitian  operator  H,  and  E*  is  its  com- 
plex conjugate.  Since  H is  hermitian, 

(28)  (u,Hu)  = (Hu,u)  = (Eu,u)  = E(u,u) 

Subtracting  (28)  from  (27)  we  obtain 

(29)  E*(u,u)  - E(u,u)  = 0 . 

Since  (u,u)  = 0,  therefore 

(30)  E*  - E - 0 , 

which  implies  that  E is  real. 
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4.  THE  NUMERICAL  METHOD . In  this  section  we  shall  deal  primarily 
with  the  algebraic  manipulations  necessary  to  implement  a numerical  solu- 
tion of  Eq.  (19).  We  will  first  cover  the  matrix  algebra  analysis  of 
determinant  (20),  and  then  discuss  the  computation  of  the  eigenfunctions. 

A.  The  Eigenvalue  Equation.  Equation  (20),  from  now  on,  will  be 
referred  to  as  the  eigenvalue  equation.  We  have  already  stated  that  the 
determinant  equation  will  have  an  infinite  number  of  real  roots  that  are 
the  bound  state  eigenvalues  for  the  approximate  problem  (1),  (2)  and  (3). 
At  this  point  we  will  shift  the  elements  in  the  last  column  to  the  second 
column.  The  roots  of  (20)  are  not  affected  by  this.  By  doing  this,  (20) 
becomes 


G ( b x i ) *" F ( x i ) ~ G( b^x-j ) 


GOy,) 


(31) 


•F(b2x1 ) 

F(b2x2) 

F ( b2x2 ^ 


GribgXi ) 

G(b2x2) 
G ( brjXg  ) 


-Gi>n<vi  - o] 

-G[bn(Vi  - L>] 


”F(^3x2^  ™ ^ ( b3x2 ) 
~ G(b2X2) 


F(b 


n-Pn-l 


) G(bn-1  Vl^ 


Ffb  ,x  ) G(b  -j  x 1 ) 
x n-1  n-1  n-1  n-1' 


= 0 


We  will  group  the  elements  in  determinant  (31)  in  2 x 2 sub-matrices 
as  follows: 


(G(b-,x-j ) 0\  /-F(b2x1)-G(b2x1) 

^b]Xl)  0/  VF(b2x1)-G/(b2x1) 

*)  ( 

,0  0/  \F'(b2x2)  G{b2x?)j  \^-F'(b3x2)-Gtb3x2) 


F(b2x2)  G(b2x2)^  ^-F(b3x2)-G(b3x2)' 


(0  -G[bn(x„-r  L)])  1°  1 

lO  -G[bn(xn.rL)]  / Vo  0, 


(Q  0^ 


,0  OJ 


fo  cn 
ko  Oj 

'0  ov 

t0  Oj 


b(bn-lXn-l 

:(bn-lXn-l 


) G(b  .x 
' ; n-1  n-1 

1 G(b  i x -j 
v n-1  n-1 


(32) 


= 0 
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The  sub-matrices  in  the  first  and  last  row  of  first  column  of  deter- 
minant (32)  correspond  to  the  first  and  last  regions  specified  by  bj  and 
bn  and  the  first  and  last  interfaces  specified  by  X2  and  xn_2  ~ L«  All 
other  sub-matrices  of  determinant  (32)  are  of  the  same  form  and  refer  to 
a single  region  specified  by  b^,  i = 2,3,...,n  - 1 and  a single  interface 
specified  by  x^,  i = l,2,.„„»n  - 1„ 

We  shall  now  define  the  2x2  sub-matrices  in  determinant  (32)  as 


follows: 

/GCb^) 

-G[bn(Vl  ■ 

A11 

\g' (bjxj) 

o/ 

n-1,1  Vo  -G'[b  (x 

n n-1 

A.  . 

AVd 

ij 

\F  * (b . x . ) 
J 1 

G?(  bAxJ 

j 1 

(33) 

A zero  will  designate  the  null  2x2  sub-matrixo  Equation  (32)  can 
now  be  written  in  a more  compact  and  hand able  form,  namely; 


A 


11 


-A 


12 


0 


0 


A22  "A23 


0 


o o a33 


0 


0 0 0 
0 0 0 


An - 3 , n - 3 An-3,n-2 


0 


n-2,n-2 


0 

~An-2,n-l 


0 0 


An-1  ,n-l 


(34)  = 0 . 

Post  laultiplying  the  last  column  by  -A  \ .A  and  adding  the 

i , r*  -i  ^ . n-l.n-1  n~l*l 

result  to  the  first  column  we  obtain 
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Ah 


'n 


-A12  0 


A22  "A23 


0 A 


33 


A A"^  A 

n-2,n-lMn-l ,n-lMn-l ,1 


0 

0 

0 


0 

0 

0 


0 

0 


An-3,n-3  "An-3,n-2  0 


0 

0 


\i-2,n-2  "An-2,n-l 


Vl  ,n-l 


(35)  = 0 . 

Now  expanding  determinant  (35)  by  the  last  row 


A = 


Vl,n-l|  X 


A 

0 

0 


11 


0 

0 


.-1 


(36) 


-A 


12 


A 


22 


0 

0 


An-2,n-lAn-l  ,n-lAn-l  ,1  0 


0 

0 

0 


• • i 


0 

0 

0 


An-4,n-4  "An-4,n-3  0 


0 

0 

= o . 


An-3,n-3  "An-3,n-2 


0 


A 


n-2  ,n-2 


The  second  determinant  on  the  right-hand  side  of  Eq.  (36)  has  the  same 
form  as  the  original  determinant  (34).  We  now  repeat  the  process.  Post 
multiplying  the  last  column  of  the  second  determinant  on  the  right-hand 

side  of  (36)  by  "A^i2  n_2V2,n-lAnil,n-lAn-l,l  and  addin§  the  result  to 
the  first  column  we  obtain 
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An-1 ,n-l| X 


A 

0 

0 


11 


12 


22 


0 

0 

0 


An-3,n-2An-2,n-2An-2,n-lAn-l ,n-lAn-l ,1 
0 


0 

0 

0 


"An-4,n-3  0 
An-3,n-3  “An-3,n-2 


A, 


h-2  ,n-2 


(37) 


= 0 


Again,  expanding  the  second  determinant  in  Eq.  (37)  by  the  last  row, 
we  obtain 


A 


An-l,n-l  X An-2,n-2l  X 


A 

0 

0 

♦ 

< 

0 

0 


11 


A 


n-3,»-2rtn-2,n-2''n-2:,n-l'Vl  .n-V'n-l  ,1 


•1 


.A 


A. 


1 


,A 


-A 


12 


A, 

i 

0 

• 

0 

0 

0 


22 


0 

0 

0 


An-5ln-5"An-5,n-4  0 


0 

0 


An-4tn-4  "An-4 ,n-3 


0 


A 


n-3,n-3 


(38) 


= 0 


The  third  determinant  on  the  right-hand  side  of  Eq.  (38)  has  the  same 
form  as  the  original  determinant  (34) . It  is  evident  that  we  can  repeat 
the  process  recursively  n-2  times  to  reduce  the  original  eigenvalue  Eq.  (34) 
to  the  form 
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= 0 . 


H " IVl,n-llXlAn-25n~2lXIV3,n-3lX  *"  XIA22 


(39) 


lAll  + A12A22A23A33A34A44 


From  Eq.  (33)  and  Table  I it  is  evident  that 


(40) 


x 2^3^««e^n  “ 1 « 


Therefore,  Eq®  (39)  reduces  to 
(41)  | A j = |AU  + a12A22A23A33A34A44 


We  have,  thus,  reduced  the  evaluation  of  the  determinant  of  a 
(2n  ~ 2)  x (2n  - 2)  matrix  in  (20)  to  that  of  a 2 x 2 matrix  on  the  right 
side  of  (41). 

Consider  a bound  state  eigenvalue  corresponding  to  an  eigenfunction 
with  two  turning  points.  In  Fig.  (4)  the  turning  points  are  indicated  at 
t^  and  t2.  In  the  classically  forbidden  regions,  the  eigenfunction  behaves 
exponentially,  that  is 

(42)  a.  H E - V.  < 0 . 

i i 


By  Table  I,  the  eigenfunctions  in  these  forbidden  regions  are 
expressed  in  terms  of  hyperbolic  functions,  reflecting  their  exponential 
behaviour.  At  this  point  it  is  necessary  to  point  out  that  we  would  run 
into  formidable  numerical  problems  if  we  were  to  evaluate  Eq.  (41)  directly 
using  (33).  This  is  because  in  the  classically  forbidden  region  the  eigen- 
function might  be  several  orders  of  magnitude  smaller  than  in  the  allowed 
region.  In  numerical  methods  these  problems  are  known  as  scaling  problems 
[7],  or  as  asymptotic  problems.  The  reasons  can  be  understood  by  further 
examination  of  Eq.  (41) 0 Consider  the  nonsingular  matrix  in  (41)  furthest 
to  the  right  of  the  turning  point;  see  also  Eq.  (33)  and  Table  I. 


7cosh  b t,x  „ 
n-1  n-Z 


sinh(b  ,x  )/b 

n-1  n-2  n-1 


(43) 


n-2 , n-1 


b ,sinh  b ,x  „ cosh  b ,x  „ 
n-I  n— 1 n-Z  n— 1 n-Z 


In  quantum  mechanics  problems,  the  arguments  of  the  hyperbolic  func- 
tions in  (43)  can  grow  quite  big,  say  in  the  neighborhood  of  100.  We 
would  be  in  trouble  because 

(44)  sinh  100  = cosh  100  ■ e100/2  . 
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When  numerically  multiplying  the  2x2  matrices  in  (41) , while 
evaluating  the  determinant  A = f(E),  differences  such  as  sinh  100  - 
cosh  100  must  be  evaluated  numerically  and  the  computer  returns  them  as 
zero.  That  is,  the  direct  evaluation  of  A = f(E)  gives  the  result 

(45)  A E f (E)  = 0 , 


for  E in  the  range  of  interest.  We  can  remedy  this  problem  by  grouping 
the  matrices  in  (41)  except  the  two  singular  A-^  and  An_^  whose  argu- 
ments are  b^x^  and  tn(xn_^  - L)  respectively  (see  Eqs.  (3$)),  as  follows. 

By  performing  matrix  multiplications  and  using  elementary  addition 
formulas  for  the  circular  and  hyperbolic  functions  we  shall  derive 
expressions  for  the  matrix  product  A^_^  in  the  classically  forbidden 

region  outside  the  turning  points  expressed  by 


(46) 


a.  = E - V.  < 0 , 

l l 


and  in  the  allowed  region  inside  the  turning  points  expressed  by  (see 
Fig.  4)) 


(47) 


a.  E E - V.  > 0 . 

l l 


Using  (33)  we  obtain  the  following  matrices: 
A. 


li 


(48) 


A<biV 

G^iXi^ 

G'Gu.x.), 

AVi-i* 

G (b  . x . 

l l- 

\F' (bix±_1 

) G'(b.x. 
1 1' 

The  determinant  of  A^.  is  given  by 


(49) 


| A | = 1 , i = 2,3 n - 1 . 


The  inverse  of  A^  is  given  by 


-i  j0' (b±v  -F' (bixi)  \!_i_ 

11  \-G(b.x.)  F (b  . x . ) / A 11  ^ 

i l li 
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Forbidden 
Region  E < V. 


(50) 


ci  , /G' <V±>  -F' <biV\T. 

11  \-G(b  x ) F(b  x.)  / 

11  xi 


-1  . 


The  matrix  product  A,  ..  .A. . is  given  by 

i-l,i  li  J 


L A-i  = /F(biVi>  G(biVi)\/G,(bixi)  ■p,Cbixi)\T> 

1-1,1 11  \F'(Vi-i)  G'(bixi-i)A-G(Vi)  F(Vi}  / 

C(b±xi_1)  G'(b.x.)  - G(b.x._1)F- (b.x.)  -F(b1x._1 

' (biXi_i)G' (biX±)  - GT(bix1_i)F,(bixi)  -F(b  x L 


)G(b.x.)  + G(b . x . -)F(b.x.) 

li  l l-l  l l 

) G (b . x . ) + Gf (b.x.  -)F(b.x. 

li  l l-l  l l 


(51) 

For  the  classically  forbidden  region  outside  the  turning  points  (51) 
becomes 


-1 


/cosh(bi(xi  - x^_p)  -b^  sinh(b^(x^  - x_^  ^)) 


» -L  I -LX  X"" X 

i.  , .A..  =( 

1 * 11  \-b.sinh(b. (x.  - x 

' l li  l-l 


))  cosh(b^(x^  - x p) 


(52) 


And  for  the  allowed  region  inside  the  turning  points  (51)  becomes 


(53)  A. 


, / cos (b  . (x.  - x.  ))  -blsin(b.  (x.  - x.  _))\ 

. , ,a:1  - ( 111-1  1 1 1 l-l  ) „ 

i ’i  ii  \b^sin(b^(x^  - xi_1))  cos(b^(x^  - x^_p)  j 


This  simple  manipulation  has,  thus,  eliminated  in  one  stroke  all 
the  scaling  difficulties  connected  with  the  eigenvalue  equation,  because 
the  arguments  of  the  hyperbolic  functions  have  been  reduced  by  at  least 
two  orders  of  magnitude  with  respect  to  (43).  For  example,  if  the  poten- 
tial is  approximated  by  50  steps,  then  the  step  width  - x-[_i  = h < L/50. 
The  simple,  analytical  elimination  of  the  scaling  difficulties  before  com- 
putation has  obvious  advantages  over  a numerical  treatment  during  compu- 
tation of  the  eigenvalues. 

This  section  is  concluded  with  a description  of  the  numerical  method 
used  to  find  the  roots  of  the  determinant  Eq.  (41),  whose  matrices  have 
been  grouped  as  shown  in  (52)  and  (53) 0 These  roots  are  approximations 
to  the  eigenvalues  for  the  bound  states.  It  will  be  useful  to  think  of 
A in  (41)  as  a function  of  a real  variable,  f(E), 

(54)  f(E)  = |A|  , 
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whose  zeros  will  be  determined  numerically.  The  eigenvalue  search  is 
facilitated  by  the  fact  that  all  eigenvalues  are  bounded  from  below  by 
the  minimum  of  the  potential  [8], 


(55) 


E > V . 

n - mm 


In  (asymptotic)  quantum  mechanics  problems  with  deep  potential  wells, 
the  fundamental  energy  eigenvalue,  E , approaches  asymptotically  the 
potential  minimum  [9], 


(56) 


E + V . 
o min 


A computer  subroutine  EIGEN  was  written  that  computes  the  function 
f (E)  in  a predetermined  number  of  integral  points  in  a range 


(57) 


V . < E < V , , 

mm  - - right 


where  Vright  is  chosen  arbitrarily  and  is  to  be  sufficiently  to  the  right 
of  Vmin  depending  on  how  many  eigenvalues  are  desired.  Integral  values 
for  Vw in  and  are  chosen.  EIGEN  computes  f (E)  for  all  integral 

values  of  E in  the  range  (57).  Whenever  a change  in  sign  of  the  function 
f(E)  at  two  successive  (integral)  values  of  E,  it  will  store  those  two 
values,  and  proceeds  until  it  encounters  another  sign  change,  whereupon, 
it  will  store  the  two  successive  (integral)  values  of  E at  which  the  sign 
change  occurs,  and  so  on.  This  process  is  continued  until  the  entire 
range  (57)  of  E is  scanned. 

Having  stored  all  integral  intervals  of  E for  which  a sign  change  for 
f(E)  occurred,  the  subroutine  EIGEN  will  then  subdivide  these  intervals 
into  ten  increments  of  0.1  and  re-scan  each  of  the  intervals  for  a change 
in  sign  of  the  function  f(E)  at  two  successive  points.  Each  one  of  these 
integral  intervals  will  contain  two  successive  points  for  which  the  func- 
tion f (E)  will  change  in  sign.  This  latter  set  of  pairs  of  successive 
points  is  stored  by  EIGEN.  Again  this  last  set  of  intervals  is  subdivided 
into  ten  parts  of  0.01  and  the  process  is  repeated. 

It  should  now  be  obvious  that  this  process  can  be  continued  indefin- 
itely until  the  desired  accuracy  of  the  eigenvalues  is  reached.  For 
example,  if  four  iterations  are  performed,  the  location  of  the  eigen- 
values is  ascertained  with  three  decimal  accuracy  (see  the  computer  out- 
put for  further  examples).  The  search  is  terminated  when  the  range  (57) 
is  scanned  completely  or  when  a predetermined  number  of  eigenvalues  has 
been  found. 

B.  COMPUTATION  OF  THE  EIGENFUNCTIONS.  It  has  been  found  convenient 
to  rewrite  system  (19)  as 
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12C2 

= 0 

22C2 

" A23C3 

= 0 

A33C3 

- v* 

= 0 

A4464  ' \5S5 

= 0 

A 1 1C1 
n-1,1  1 


(58) 


A 0 _c  -A„  ,c  = 0 
n-2,n-2  n-2  n-2,n-l  n-1 

+ A . .c"  = 0 

n-1, n-1  n-1 


where  Aij  are  the  2x2  matrices  given  by  (33)  and  c.  are  the  following 
2x1  vectors:  x 


(59) 


'■(y-'-o 


, x 2,3, ... ,n  1 . 


The  components  of  the  vectors  c^  are  the  coefficients  of  the  eigen- 
functions in  the  boundary  regions  1 and  n (see  Eq.(l3))0  The  first  equa- 
tion in  (58)  represents  the  first  two  equations  in  system  (19) , that  is 
(using  Eqs.  (33)  and  (59)): 


(60) 


'G(Vl>  °\  /M  _ /F(b2xl)  G(Vl>\  /A2\  . /°\  _ 

.GCbjX^  0/  \eJ  Wb^)  GO.f-^1  \B,/  \o/  ’ 


or,  performing  the  matrix  multiplication 


(61) 


/BlG<blXl>  'V^lV  - B2G(b2xn  = /0  V . 
'kBlG(blXl)  "A2F(blXl)  " B2G(b2Xl}' 


Equation  (61)  represents  the  first  two  equations  of  system  (19)  in 
vector  form. 

The  components  of  the  other  vector  ci,  i ^ 1,  are  the  coefficients 
of  the  eigenfunctions  in  the  inner  regions  (see  Eq.  (6)).  Once  the 
eigenvalues  have  been  determined,  the  matrices  Aj_j  (Eqs,  (33)  and  (50)) 
are  explicitly  determined,  A direct  numerical  solution  of  (58)  would 
fail  because  we  would  encounter  scaling  problems  of  a nature  mentioned 
before  in  the  computation  of  the  eigenvalue  Eq.  (41) o These  scaling 
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problems  occur  while  performing  numerical  solutions  of  most  quantum 
mechanics  problems.  In  what  follows,  it  shall  be  demonstrated  that  a 
direct  numerical  solution  of  (58)  would  introduce  formidable  numerical 
and  computational  problems. 

Let  us  determine  the  coefficients  of  the  eigenfunction  at  the  inner 
region  (n  - 1)  furthest  to  the  right,  which  is  assumed  to  be  outside  the 
turning  points  t^  and  t£  in  the  classically  forbidden  region  (see  Fig.  5). 

From  the  last  equation  in  (58),  we  get 


or 

(62) 


^n-l,n-l^n-l  An-l,l^l  ’ 

— ^ , “1  * — ^ 

c = -A  , ,A  . -c. 

n-1  n-l,n-l  n-1,1  1 


Let  us  assume  that  the  eigenfunction  is  symmetric 


(63) 


y(x)  = y(-x)  , 


for  x in  all  regions. 

Using  (63),  Eq.  (13)  and  Table  I,  we  have 

(64)  B > 0 , B <0  . 

1 n 


Since  the  eigenfunctions  in  these  regions  are  uniquely  determined 
except  by  a constant  factor,  we  set 


(65) 


B = 1 , B - -1  . 

1 n 


Thus, 

(66) 


■0-0 


Multiplying  ^ ^ by  c^,  we  get 

^0  -G[b  (x 
r ~ 

,0  -G[b 


, . __r  -^vvi-^n/n 

n"1,lCl  \o  -G[bn(xn_1  - L)]/\-l/ 

, /G[VvrL)]\ 

n_1}lCl  \G[bn(xn_1  - L)]/ 


(67) 


Now  multiplying  A 


n-1, n-1 


by  (67),  we  get 


(cosh(b  .x  .) 
n-1  n-1 


bn-l3l“h(bn-lVl)\ /G[ln(Xn-l  ‘ L)  > 


"n-1 


i b , sinh (b  _x  .)  cosh(b  ,x  ) 
n-1  n-1  n-1  n-1  n-1 


G[b  (x  i - L)] 
n n-i 


^cosh(b  ,x  .) 
n-1  n-1 


-1 


-bn-iSinh(bn_iVl)\  /^Wl  ’ L)  ] 


n-1 

(68) 


-b  sinh(b  x ) coshO>  •, ) 

n-I  n-I  n-1  n-l  n-1 


tG[b  (x  - L)] 
n n-i 


The  arguments  of  the  hyperbolic  functions  in  the  2x1  vector  in 
(68)  are  relatively  small,  while  those  in  the  2x2  matrix  of  (68)  are 
relatively  large,  typically  in  excess  of  100.  There  is  now  no  numerical 
difference  between  the  hyperbolic  sine  and  cosine  in  the  matrix  multipli- 
cation in  (68) . 


Performing  the  matrix  multiplication  in  (68)  we  obtain  the  coeffi- 
cients An_^  and  Bn_^  for  the  eigenfunctions  in  region  n-1.  Using  Eq.  (6) 
we  obtain  the  eigenfunction  in  region  n-1, 


(69)  y(x)  = B(cosh  b - x - sinh  b , x)  = 0 , xe(x  , ,x  = L) 

n-1  n-1  n-l  n 

where  B is  a constant.  Notice  that  for  bn-1  x > 100  there  is  no  numeri- 
cal difference  between  the  two  hyperbolic  functions  in  (69).  Therefore, 
the  computation  of  the  eigenfunction  in  this  region  n-1  would  yield  zero 
identically. 

This  problem  can  be  solved  analytically,  as  was  done  before  with  the 
eigenvalue  Eq.  (41),  by  using  elementary  addition  formulas  for  the  circu- 
lar and  hyperbolic  functions. 

Rewriting  the  first  n - 2 equations  of  system  (58)  as: 


c n = A - .A  „c  • 

n-1  n-2,n-l  n-2,n-2  n-2 


(70) 


c„  = 


-1.  -v 
A12A11C1 


c„  = 


A-1A 

A23A22°2 


A ^A  c" 
34  33  3 


-l 
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It  is  obvious  that  we  can  now  write  (integrate)  system  (70)  in  terms 
of  the  vector  c^  as  follows: 


(71) 


“I.  -A 

C2  = A12A11C1  » 

C3  = A23A22A12A11C1  * 

c4  = A34A33A23A22A12A11C1  ’ 


c 

m 


A12A11C1 


The  last  equation  in  (71),  for  m * n - 1,  determine  cn_]_  in  terms 
of  c^,  and  the  last  equation  of  (58)  also  express  cn_^  in  terms  of  c^. 
From  these  two  equations  we  could  obtain  the  last  integration  constant 
Bn  which  appears  in  "c^  to  completely  determine  the  problem. 

For  example,  from  the  last  equation  in  (71) 


(72) 


Dc, 


J 


-X 

where  D is  a 2 x 2 matrix  designating  the  product  ,m^’  * 

and  from  (56)  we  get 

(73)  ' 

where  E is  a 2 x 2 matrix  designating  the  product  n-l^n-1  \* 

Hence,  subtracting  the  two  vector  equations  we  get  * * 


(74) 


or 

(75) 


(D  - E)cx 


» 


iB, 

+ a_B  » 

0 

X X 

2 n 

oB, 

+ a.B  * 

0 

3 1 

4 n 

Since  B1  is  an  arbitrary  integration  constant,  let 
(76)  B1  = 1 , ■>  -a1/a2  „ 
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Where  a-^,  i = 1,...,4  designate  the  matrix  elements  of  the  2x2 
matrix  difference  (D  - E) . 

This  cannot  be  readily  done  because  of,  again,  scaling  (asymptotic) 
problems.  That  is,  numerical  errors  would  accumulate  and  propagate  if 
(58)  were  solved  from  top  to  bottom;  that  is  the  same  as  starting  with 
an  arbitrary  value  of  the  eigenfunction  at  the  left  boundary  and  integrat- 
ing all  the  way  to  the  right  boundary.  To  avoid  this  difficulty,  the 
solution  of  (58)  as  given  by  (71)  is  stopped  at  m = n/2,  i.e.,  the  mid- 
point of  the  domain,  which  is  always  taken  at  the  minimum  (x  = a)  of  the 
potential  function.  In  symmetric  potential  problems  this  is  all  that  is 
needed,  since  the  eigenfunction  is  either  symmetric  or  antisymmetric  about 
this  point  (x  = a).  For  central  field  problems  with  asymmetric  potentials, 
after  computing  the  eigenfunction  from  the  left  to  the  midpoint,  (58)  is 
solved  via  (71)  from  the  bottom  till  the  midpoint,  i.e.,  the  eigenfunction 
is  now  computed  from  the  right  inward  to  the  midpoint,  and  then  both 
pieces  of  the  eigenfunction  are  matched  at  the  center.  The  eigenfunctions 
so  computed  on  either  side  of  the  point  x = a are  the  same  except  that 
they  differ  by  a constant  factor,  i.e.,  their  derivatives  match  at  the 
midpoint. 

The  matching  of  the  eigenvalues  is  done  by  multiplying  each  piece  by 
its  reciprocal  value  at  x = a,  so  that  both  pieces  also  have  the  same  mid- 
point value,  that  is. 


y(x)  = 


yL(a)  yL 


y (x)  , 0 < x < a , 


(77) 


y<x> " yR(x)  ’ a - x ' 


Except  for  the  first  matrix  on  the  right  side  of  (77),  the  others 
will  be  grouped  in  pairs,  as  follows: 


A. .AT1.  . 
xi  x-1,1 


(78) 


'F(biXP  6^blxl^ 

F’C^Xi)  G'  (bjXj 


(F(b.x.)  G(b.x. ) 

IX  XX 

tF' (bfXi)  G’ (b±Xi) 


F(biXi-l)  G<biXi-l> 
F’(biXi-l)  G'(biXi-l)j 


G’(b,Xj  ,)  -F*(b.x.  J 


-1 


x x-1 


X x-1 


-G(bA_i)  F(blVl) 


^(b.x.)  G(b . 


.F,(b.x.)  G 

l l 


(biXi)  \ /G,(biXi-l>  -G<bixi-1>\ 

f(bixi>/  VF,(bixi-i>  F(biVi> ) 


m / F(b  ixi)G'  (bixi-i)-F(bixi)G(bixi-i)  + G(bixi>F(bixi-i) 

F'  (bixi)GT  (b;Lxi_1)-Gf  (b^F*  (b^x^_1)-F*  (b^GO^x^)  + G’  (b^F^x^ 
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V(x) 

I 


Figure  6 


An  Eigenfunction  Discontinuity  at  the  Midpoint  Value  of  the 
Domain. 
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For  the  classically  forbidden  region  (78)  becomes, 


(79)  A1±A^ltl 


(coshfb^x^  - xi_1)  ] b^  sinhtb^x^  - 
[b^sinhfb^x^  - x^^)]  coshtb^x^  - vd  J 


For  the  allowed  region  (78)  becomes 
(80) 


-1  . 


, ,cosb  (x  - x ) b sinb  (x  - x .)' 

* \ x i i i x-l  x x i X“*x 

A A 1 ■ I 

11  * '-b.sinb  (x.  - x ) cosb.(x.  - x.  .) 

,i  i i i-1  i x x-l 


Notice  that  the  arguments  of  the  hyperbolic  functions  in  (79)  are 
two  orders  of  magnitude  smaller  than  if  the  matrices  were  computed  indi- 
vidually. Equations  (71)  will  now  be  written  as  follows: 


(81) 


'2 

'3 

A 

•> 

'4 


c 

m 


A12v2 

> 

A 1'*^’ 

A23V3 

9 

A34V4 

9 

• 

• 

-1 

A , 

V 

m-l,r 

ft  m 

V2"Vl  ’ 

v3  * A22A12A11C1  ’ 

V4  = A33A23A22A12A11C1  ’ 


.-1 


* , v , v * A , -A 

m-l,m  m ro  m-l,m-l  m-2,rn-l 


> * • A c 

Hi 


The  vectors  v-  in  (81)  are  evaluated  by  grouping  the  matrices  in 
pairs  as  shown  in  (79)  and  (80),  except  the  last  matrix  An  where  the 
arguments  of  the  hyperbolic  functions  are  b^x^.  As  indicated  before, 
the  vectors  involve  only  one  arbitrary  integration  constant  But 

we  are  still  in  numerical  trouble  since  the  inverse  matrices  Ajj  at  the 
left  in  (81)  still  involve  large  arguments  and  therefore  these  inverse 
matrices  are  numerically  unstable.  The  computation  of  the  coefficients 
still  present  numerical  difficulties. 

We  can  circumvent  this  problem  since  we  do  not  need  to  compute  the 
vectors  of  the  coefficients  c^  per  se.  After  the  2x1  vectors  v^  in 
(81)  have  been  computed  they  will  be  of  the  form 
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From  Eqs.  (6),  (59)  and  (81)  we  have  the  following  expression  for  the 
eigenfunctions  in  vector  form  or  notation: 


(83) 


y(x)  = c±  • f(x)  , 
-1 


y(x)  = A.i_1>ivi  • f(x)  , i = 2,3, . . ,n  - 1 


where  f(x)  is  the  vector  function 


(84) 


f(x)  = 


'f  (bjX)’ 

^(b^); 


Now  using  Eqs.  (13),  (82)  and  (84)  we  obtain  the  following  expression 
for  the  inner  product  in  (83) 


y(x)  = 


il 


12  J 


^(b.x)) 

[G(b±x) j 


y(x)  = 


^uG,(biV1)  -vi2G(bixi-iM  /F(bi^ 

^viiF’(bixi-i>  + vi2F(Vi-iy  lG(bix) 


y(x>  ' (vilG' <biXi-l)  ' Vi2G(biXi-l,’-VilF’(Vi-l) 


+ vi2r(b1x._1)) 


(?(b.x)N 


G(b±x) 


y(x)  = v_^G’  (b.x^_^)F(b^x)  - vi2G(bixi_1)F(bix) 


- V.-.F'  (b.x.  )G(b.x)  + v.0F(b.x.  )G(b.x)  , i = 2,3,...,n  - 1 

Xl  1 x-l  X 1Z  X 1-1  1 


(85) 


In  the  classically  forbidden  region  (85)  becomes 
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y«  - vu 

- V 

y(x)  = v±1 


cosh(b.x  )cosh(b.x)  - v b.^sinMb.x.  )cosh(b.x) 

X X““X  X XZ  X X X"™  X X 

,,b.sinh(b.x.  , )b.^sinh(b.x)  + v.„cosh(b.x.  , )b .^sinh(b .x) 
xl  1 x x— 1 x x x2  x x— 1 x x 

coshb.(x.  , - x)  - b.^v.„sinhb.  (x.  . - x)  , 
x 1-1  X i2  x l-l 


5 


(86) 

In  the  allowed  region  (85)  becomes 


in  region  i = 2,3,...,n  - 1. 


y(x)  =v.1cos(b.x.  )cos(b.x)  - v b. ^sin(b ,x._1 )cos (b .x) 

XX  X X“X  X XZ  X X x— X X 

- v...  (-b.sin(b.x._  ))b.1sin(b.x)  + v cos(b.x  )b.^sin(b.x)  , 


y(x)  = v cosb.(x.  - x)  - b/v  sinb,(x,  - x)  , 
li  1 1-1  1 iz  1 1-1 


in  region  i = 2,3,0*o,n  - 1 . 

(87) 

In  Eqs.  (86)  and  (87)  v.q  and  vi2  are  components  of  the  vector  v^ 
as  defined  in  (81)  and  (82). 

Having  been  able  to  use  the  elementary  addition  formulas  has  allowed 
us  to  analytically  perform  the  inner  product  in  (83) , thus  eliminating 
the  need  to  evaluate  hyperbolic  functions  with  very  large  arguments,,  It 
can  readily  be  seen  that  in  the  computation  of  the  eigenfunctions  in  (86) 
and  (87),  it  is  only  necessary  to  evaluate  the  vectors  v^  in  (81)  and  not 
the  coefficients  c^  in  (81) . The  computation  of  the  vectors  v^  does  not 
present  any  numerical  problems. 

5 . NUMERICAL  RESULTS . In  this  section  we  will  describe  some  of  the 
results  obtained  with  the  computer  subroutine  EIGEN.  First  we  dealt  with 
a central  field  Quantum  Mechanics  problem,  the  radial  Schrodinger  Eq.  (1) 
with  Morse's  potential  [10], 


(88)  V(x)  = D(1  - exp(-a(x  - x )))^  - D , 
where 

(89)  a = 0.711248  , x = 1.9975  , D = 188.4355  . 

e 

The  boundary  conditions  (2)  used  are 

(90)  y(0)  = y(10)  = 0 „ 
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The  second  problem  covered  was  that  of  Mathieu’s  Eq.  (11),  where 
the  fictitious  "potential"  is 

(91)  V(x)  = 2q  cos  2x 

The  boundary  conditions  (2)  used  are  in  this  case 

(92)  y(0)  - y(Tr)  - 0 . 

Although  Mathieufs  equation  is  not  a true  Quantum  Mechanics  problem, 

Eqs.  (1),  (91)  and  (92)  can  be  thought  of  as  the  bound  states  of  a parti- 

cle in  a box  of  length  tt  and  infinitely  high  walls  with  the  potential 
inside  the  box  given  by  (91). 

The  primary  reasons  why  these  two  problems  were  chosen  are  that 
MorseTs  potential  has  well  known  analytic  solutions  [12],  and  provides 
a good  check  for  the  numerical  solutions  of  the  eigenvalues;  and 
Mathieu's  equation  has  also  been  well  documented  [13]  and  is  a good 
check  for  the  numerical  results  obtained  for  the  nodes  of  the  eigen- 
functions. 


A,  Schrodinger Ts  Equation  with  Morse* s Potential.  Morse 1 s poten- 
tial was  approximated  by  a step  function  with  an  equal  number  of  steps 
m " n/2  in  the  ranges 


(93)  0 < x < 1.9975  , 1.9975  < x < 10  . 

The  interface  was  chosen  at  the  abscissa  of  the  minimum  value  of 
the  potential 

(94)  VU.9975)  = V . = -188.4355  . 

v min 

This  potential  varies  rapidly  in  the  neighborhood  of  its  minimum 
than  towards  the  right  boundary,  where  it  is  relatively  flat.  The  n/2 
potential  steps  in  the  right  range  of  (93)  were  taken  as  follows: 

n/4  steps  in  1.9975  < x < 4 , n/4  steps  in  4 < x < 10  . 

The  step  function  approximation  is 
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C V,  = (V(0)  + V(x, ))/2 
V2  - (V(x,)  + V(xz))/2 


0<  x<  x 


1 


< X<  X2 


V(x)  = < 


Vm  = -188.4355 
V1I]+1  = -188.4355 


Vl<x<xm“  1'9975  ; 


Vx<Vl 


^ni+2  ^xm+l^  + V^Xm+2^2  » xm+l<x<xm+2  : 


^ vn  = ^V(xn-l)  + v(xn))/2  » Vl<x<xn  = 1°  ; 
m = n/2  . 


(95) 


The  three  step  widths  used  are 

0 < x < 1.9975  , hx  = 1. 9975/ (n/2)  , 

1.9975  < x < 4 , h « (4  - 1.9975)/(n/4) ,, 

4 < x < 10  , h^  = (10  - 4)/(n/4) 

The  eigenvalues  were  obtained  by  searching  for  the  roots  of  the 
eigenvalue  Eq.  (20)  in  the  range 


(96) 


-188.0  < E < -108.0  . 


The  numerical  results  for  the  first  five  eigenvalues,  when  the  poten- 
tial is  approximated  by  n = 200  steps,  together  with  the  exact,  analytical 
results  are  given  in  Table  II.  Table  III  contains  the  nodes  of  these 
first  five  eigenfunctions. 


Table  II 

First  Five  Eigenvalues  of  Schrodinger' s Equation 
With  Morse's  Potential 


N 

200 

Exact 


E0 

-178.777 

-178.799 


E1 

-160.264 

-160.283 


E2 

-142.763 

-142.760 


e3 

-126o 275 
-126.288 


E4 

-110o796 

-110.809 
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Table  III 

Nodes  of  Morse's  Eigenfunctions 


I 

N11 

N12 

N13 

N14 

N15 

1 

0.000 

0.000 

0.000 

0.000 

0.000 

2 

1.888 

0.000 

0.000 

0.000 

0.000 

3 

1.768 

0.000 

0.000 

0.000 

0.000 

4 

1.688 

0o000 

0.000 

0.000 

0.000 

means  the  node  of  the  1^  eigenfunction. 

B.  Mathieu's  Equation.  The  Mathieu's  "potential"  is  approximated 
as  follows: 


The  numerical  results  for  q = 40  are  given  in  Tables  IV  and  V,  for 
n = 200,  together  with  the  exact,  analytical  results  taken  from  Ince's 
paper  [13]. 
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Table  IV 


First  Five  Eigenvalues  of  Schrodinger * s Equation 
With  Mathieus  Potential 


200  -67.595 

Exact  -67.606 


-43.342  -20.200 

-43.352  -20.208 


E 

3 

1.736 

1.730 


22.337 

22.332 


Table  V 

Nodes  of  Hathieu’s  Eigenfunctions 


I 

N11 

N12 

N13 

Nu 

N15 

1 

1.563 

2.757 

3.118 

0.000 

0.000 

2 

1.359 

1.767 

2.882 

0.000 

0.000 

3 

1.202 

1.563 

1.924 

3.071 

3.118 

4 

1.060 

1.406 

1.720 

2.066 

3.087 

N„  means  the  node  of  the  1^  eigenfunction. 
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